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* (54) rule: DIAGNOSIS AND TREATMENT OF BLOOD DISORDERS 

(57) Abstract: Based on the discovery of the nucleotide and amino acid differences which distinguish the Gov" and Gov b allelic 
s forms of the membrane glycoprotein CD109, and which comprise the bi allelic Gov platelet alloantigen system, compositions and 
^ methods are provided for determining the Gov genotype and phenotype of individuals. Also provided, on the basis of this discovery, 
are compositions and methods for treating disorders associated with Gov alloantigen incompatibility, such as the bleeding disorders 
post-transfusion purpura, post-transfusion platelet refractoriness, and neonatal alloimmune thrombocytopenia. The two allelic forms 
Q of CD 109 differ by a single amino acid. The Gov" allelic form has Tyr at amino acid position 703 in the CD 109 sequence. The. Gov b 
^ allelic form has Ser at the same position. This amino acid difference is due to a single change, from A for the Gov 8 allele to C for 
^ the Gov 1 ' allele, in the CD109 gene. 
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DIAGNOSIS AND TREATMENT OF BLOOD DISORDERS 

FIELD OF THE INVENTION 

The present invention provides novel compositions and methods for use in diagnosing 
the occurrence of certain serious disorders, especially certain bleeding disorders, and novel 
compositions and methods for use in treating such a disorder, in a person in which the disorder 
has occurred, and novel compositions and methods for use in avoiding such a disorder, in an 
individual who is susceptible thereto. 

BACKGROUND OF THE INVENTION 

Among the disorders, which the invention concerns, are those involving abnormal and 
excessive bleeding due to destruction of blood platelets ("platelets"). 

These disorders include, but are not restricted to, post-transfusion purpura ("FTP") and 
post-transfusion platelet refractoriness ("PTPR"), which are suffered by some persons who 
receive blood, platelets, leukocyte concentrates, or plasma from other persons by transfusion or 
the like. 

The disorders also include one that is suffered by fetuses and newborns and is known 
as "neonatal alloimmune thrombocytopenia" ("NATP"). This disorder can cause death of fetuses 
and serious birth defects or death of newborns. NATP is estimated to affect about 1 in 1000 
newborns. In NATP, fetal platelets, which enter the mother's blood stream, induce production in 
the mother of antibodies directed against fetal platelets. These maternal antibodies then pass 
with the mother's blood into the fetus and mediate destruction of platelets in the fetus. 

A mother, whose fetus or newborn suffers from NATP, is at increased risk of suffering 
PTP or PTPR. 

When platelets from a first human (a "donor") are introduced into the blood system of a 
second human (a "recipient") by transfusion, through the placenta (in the case of fetal blood 
entering the mother), or the like, the recipient may mount an immune response against the 
platelets from the donor. Such an immune response is referred to as an "alloimmune" response, 
because it involves antibodies reacting against antigens of a different individual of the same 
species. The alloimmune response to platelets is due to an immune response of the recipient 
against "alloantigens" (antigens of the same species as that mounting the immune response) on 
platelets from the donor. These alloantigens are found on membrane glycoproteins that occur 
in the cell membranes, which define the outer surfaces of platelets ("platelet membranes"). In this 
invention, the glycoprotein is anchored to the membrane in an atypical manner through an anchor 
consisting of glycosylphosphatidylinositol (GPI), which anchors an extracellular domain or 
segment of the glycoprotein exposed to the outside of the platelet It is thought that 
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alloantibodies, which are generated in an alloimmune response against platelet alloantig ns, 
interact with the extracellular domains of the alloantigens. 

The platelet alloantigens that a person has are determined by the person's genetics. A 
donor, because of his or her genetics, may have a platelet alloantigen, which a recipient, who 
receives blood, platelets, leukocytes or plasma from the donor, does not have, because of the 
recipient's genetics. In such a situation, the immune system of the recipient may recognize the 
donor's alloantigen as "non-self," and raise an immune response against, the platelet alloantigen, 
which the donor has but the recipient does not. 

Membrane glycoprotein alloantigens have been characterised for both human red blood 
cells and human platelets. It is noteworthy, however, that they also occur on other cell types, such 
as leukocytes and endothelial cells, where they may also occasion various disorders through 
alloimmune responses. 

Recognised classes of red blood cell and platelet alloantigens have been described, over 
the past 30 years, based on observations of antibody reactions occurring when blood recipients 
have been exposed to blood from donors. 

A recent review of human platelet alloantigen systems is provided by Ouwehand, W., and 
Navarrete, C, in Molecular Haematology, Provan, D. and Gribben, J. eds. Blackwell (1999). 

Several biallelic platelet alloantigen "systems" have been characterised. In each of these 
systems, there are two alloantigens, each of which is provided by one of two alleles of the gene 
comprising the system. Because each gene occurs twice in the normal human genome, a person 
can be homozygous for one or the other of the two alloantigens, or heterozygous for the two 
alloantigens, comprising a biallelic system. The alloantigens described to date occur on 
glycoprotein molecules which may exist in various forms (transmembrane, GPHinked and 
soluble, for example). In such a case, the alloantigens are found on each of the variant forms of 
the glycoprotein. For all of the biallelic platelet alloantigen systems that have been characterised 
at the level of protein and gene sequences, it has been found in all cases, except for one, that 
the difference between the two alleles is based on a single nucleotide polymorphism in the 
relevant gene. 

One biallelic system of human platelet alloantigens is the Go^/Gov 6 biallelic system 
associated with CD109, a membrane glycoprotein which occurs on platelets and various other 
cell types, including leukocytes and endothelial cells. Each Gov allele corresponds to one CD109 

glycoprotein (Sutherland, D.R. et al, 1991; Smith et al. a 1995; Berry, J. et al., 2000), consistent 

» 

with the known tissue distribution of CD1 09. The frequencies for the Gov alleles are 0.4 for Gov 3 
and 0.6 for Gov 5 in the Caucasian population. Thus, in this population, 40.7% are heterozygous 
for the Gov alleles, and will not mount an alloimmune response due to Gov incompatibility (not 
possessing the Gov alloantigen found on platelets received from another). In contrast 19.8% of 
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Caucasians are homozygous for the Gov 3 allele and thus may mount an immune response due 
to Gov alloantigen incompatibility against platelets received from anyone in the 80.5% of the 
Caucasian population that is not homozygous for the Gov 3 allele, while 39.8% are homozygous 
for the Gov 6 allele and thus may mount an immune response due to Gov alloantigen 
incompatibility against platelets received from anyone in the 60.2% of the Caucasian population 
that is not homozygous for the Gov 13 allele. 

As indicated above, alloimmunization based on Gov incompatibility (the introduction into 
the blood stream of donor platelets bearing a Gov alloantigen not carried by the recipient) can 
result in bleeding disorders due to platelet destruction, including NATP, PTPR, and PTP. The 
location of the Gov antigens within the CD109 molecule, and the nature of the CD109 
polymorphism which underlies the Gov^Gov* alloantigen (both at the protein and at the gene 
level), have not heretofore been known. 

Furthermore, it has not heretofore been possible to generate non-human antibody 
(polyclonal or monoclonal), as from a rat mouse, goat, chicken, or the like, with specificity for the 
Gov 8 alloantigen but not the Gov 6 alloantigen (or vice-versa) sufficient for use in an 
immunoassay, for typing for Gov phenotype using platelets or CD109 molecules. 

Previously developed technology, involving gene-specific amplification of platelet RNA- 
derived cDNA, followed by the determination of the nucleotide sequence of the amplified DNA, 
has been applied successfully to the elucidation of the molecular basis of other biallelic platelet 
alloantigen systems (Newman et al., J. Clin. Invest 82,739-744 (1988); Newman et al., J. Clin. 
Invest 83, 1778-1781 (1989)(P1A or HPA-1 system); Lyman et at, Blood 75, 2343-2348 
(1990)(Bak or HPA-3system); Kuijpers et at, J. Clin. Invest 89, 381-384 (1992)(HPA-2 or Ko 
system); Wang et at, J. Clin. Invest 90, 2038-2043 (1992)(Pen system). With one exception, it 
has been found in each case that a single amino acid difference at a single position differentiates 
the amino acid sequences of the two alleles, and that this difference arises from a single allele- 
specific nucleotide substitution in the coding region of the mRNA and gene. There remains a 
need to elucidate the molecular basis of the biallelic Gov platelet alloantigen system. 

SUMMARY OF THE INVENTION 

The Gova/Govb CD109 single nucleotide polymorphism 

We have now discovered that a single amino acid difference in the CD109 glycoprotein 
distinguishes the Gov 3 and Gov 5 allelic forms. The two alleles differ at amino acid position 703 
of the fulNength 1445 .amino acid CD109 molecule, with the Gov 3 allele [SEQ ID NO:2] 
containing a Tyr at this position, whil the Gov 1 * allele [SEQ ID NO:4] contains Ser. 
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Further, we have discovered that this difference in amino acid sequence between the 
allelic forms of CD109 is due to a single nucleotide polymorphism at position 2108 of the coding 
portion of full-length mRNA encoding CD109, or of the corresponding coding strand of the cDNA 
corresponding to this mRNA. Specifically, the Gov 3 allele [SEQ ID NO:1] contains adenine at 
position 2108, the second nucleotide of the codon encoding the amino acid at position 703 of the 
full-length CD109 protein, while the Gov* allele contains cytosine at position 2108, as shown in 
SEQ ID NO:3 

The Gon/VGov 6 single nucleotide polymorphism of CD109, lies at position 2108 in SEQ 
ID NO:1 . SEQ ID NO:1 is the cDNA sequence encoding the full-length 1445 amino acid CD109 
precursor encoding the Gov 3 alleleln the Gov 6 allele form [SEQ ID NO:3], C occurs at position 
2108, rather than A. The ATG at the 5'-end of the sequence, in SEQ ID NO:1 corresponds to the 
translation start of the fulHength precursor form (including leader peptide) of CD109. The triplet 
corresponding to the N-terminal amino add of the mature CD109 protein is at positions 64-66 in 
SEQ ID NO:1. 

The Gov^Gov 6 single nucleotide polymorphism of CD109, lies at position 954 in SEQ ID 
NO:5. SEQ ID NO:5 is the genomic DNA sequence of human CD109 exon 19 and the 
contiguous introns, introns 18 and 19. The Go^/Gov 5 single nucleotide polymorphism of CD109 
is found within CD109 exon 19, and specifically is located at position 3 of CD109 exon 19. The 
sequence presented in SEQ ID NO:5 contains A at position 954, and thus corresponds to the 
Gov 3 allele. The corresponding Gov 5 sequence contains C at position 954 of SEQ ID NO:5 
(nucleotide position 3 of exon 19). 

In view of this discovery, it will be readily apparent to the skilled what the present 
invention provides: 

Gov allele-specific oligonucleotides and polynucleotides: Based on the discovery, the 
present invention provides oligonucleotides and polynucleotides (seems repetitive), including (but 
not limited to) probes which can be used to determine whether a person is homozygous for one 
or the other of the Gov alleles, or heterozygous for these alleles, thereby to determine that 
person's Gov genotype, and by extension, their Gov phenotype (i.e., the Gov alloantigen(s) which 
their cells express). Further, the invention provides methods of using such oligonucleotides, and 
test kits to facilitate their use, in such Gov genotype and phenotype determinations. These 
oligonucleotides of the invention can be used to determine whether, in the CD109 gene, or in the 
mRNA encoding CD109, the internal nucleotide (nucleotide 2108) of the codon (in CD109 gene 
or in the mRNA encoding CD109) which corresponds to the amino acid at position 703 in the 
sequence of fulHength CD109 is adenine or cytosine. Such probes will typically be cDNA but may 
be genomic DNA, mRNA or RNA, and may be labelled for detection. The oligonucleotides of the 
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invention can be used as probes to detect nucleic acid molecules according to techniques known 
in the art (for example, see US patent nos. 5,792,851 and 5,851,788). 

For example, an oligonucleotide of the invention may be converted to a probe by being 
end-labelled using digoxigenin-11-deoxyuridine triphosphate. Such probes may be detected 
immunologically using alkaline-phosphate-conjugated polyclonal sheep antidigoxigenin F(ab) 
fragments and nitro blue tetrazolium with 5rbromo-4-chloro-3-indoyl phosphate as chromogenic 
substrate. 

Gov allele-specific antibodies: Still further, based on the discovery, which underlies the 
invention, of the molecular basis for the Go^/Gov 5 alloantigen system, the invention provides 
non-human polyclonal and monoclonal antibodies, which can be used to distinguish one Gov 
allelic form of CD109 from the other, whether the CD109 is part of a complex embedded in or 
isolated from a membrane or is isolated. These antibodies of the invention, which are preferably 
provided in an aqueous buffer solution, and the immunoassays of the invention which employ 
such antibodies, are useful for determining whether a person has one or both of the Gov 
alloantigens and for Gov phenotyping. Methods of using the antibodies of the invention in the 
immunoassays of the invention, and in such determinations, are also encompassed by the 
invention. The invention also provides test kits to facilitate carrying out such immunoassays and 
determinations. 

Gov allele-specific peptides and polypeptides: Again, based on the discovery that underlies 
the invention, of the molecular basis for the Go^/Gov 6 alloantigen system, the invention provides 
peptides or polypeptides, which are useful for various purposes. These peptides or polypeptides 
are typically between 4 and 100, and more typically between 7 and 50, amino acids in length, and 
have amino acid sequences identical or having sequence identity to those of segments of the 
CD109 sequences, that include the amino acid at position 703 of full-length mature CD109. This 
amino acid (position 703) corresponds the triplet at positions 2107-2109 in the CD109 cDNA 
sequence presented in SEQ ID NO:1 , or in the corresponding sequence for the CD109 cDNA 
that encodes the Gov 1 " allelic form [SEQ ID NO:3]. These peptides or polypeptides may be 
synthetic, may be purified from native CD109 or may be prepared by recombinant means. For 
guidance, one may consult the following US patent nos. 5,840,537, 5,850,025, 5,858,719, 
5,710,018, 5,792,851, 5,851,788, 5,759,788, 5,840,530, 5,789,202, 5,871,983, 5,821,096, 
5,876,991, 5,422,108, 5,612,191, 5,804,693, 5,847,258, 5,880,328, 5,767,369, 5,756,684, 
5,750,652, 5,824,864, 5,763,211, 5,767,375, 5,750,848, 5,859,337, 5,563,246, 5,346,815, and 
W0971 3843. Many of these patents also provide guidance with respect to experimental assays, 
probes and antibodies, methods, transformation of host cells, which are described below. These 
patents, like all other patents, publications (such as articles and database publications) in this 
application, are incorporated by reference in their ntirety. 
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Gov allele-sp cific peptides and polypeptides as antfg ns and immunogens, and Gov 
allel -sp cific p lycl nal and monoclonal antib dies: These peptides or polypeptides are 
useful as antigens (usually coupled to a larger, immunogenic carrier [proteinaceous or otherwise], 
as known in the art) for making the polyclonal or monoclonal antibodies of the invention. The 
peptides or polypeptides are also useful in screening monoclonal antibody-producing cultures 
(hybridoma cultures/E. coli cultures or so-called V gene phage antibodies) to identify those that 
produce monoclonal antibodies of the invention. 

The invention also encompasses immunogenic compositions which comprise a peptide, 
polypeptide or fusion compound of the invention and which are immunogenic in a bird, including, 
without limitation, a chicken, or a mammal, such as, a mouse, rat, goat, rabbit, guinea pig, sheep 
or human. The compositions may include an immunogenicity-imparting "carrier" which may be 
but is not necessarily a protein as known in the art, that is immunogenic in a bird or mammal, 
coupled to at least one peptide or polypeptide of the invention, which has an amino add 
sequence that is the same as that of a segment of the sequence for CD109, that includes the 
amino acid at position 703 of the full length CD1 09 molecule. 

The present invention also provides methods of using the peptides, polypeptides and 
immunogenic compositions of the invention for making antibodies of the invention, and methods 
of using the peptides and polypeptides of the invention in screening monoclonal 
antibody-producing hybridoma cultures or bacterial clones for those that produce monoclonal 
antibodies or fragments thereof of the invention. 

Therapeutic and diagnostic application of Gov allele-specific peptides, polypeptides, and 
antibodies: These peptides or polypeptides, as well as antibodies, which are specific for the 
Gov 3 [SEQ ID NO:2] or Gov 5 [SEQ ID NO:4], but not both, allelic forms of CD1 09 in the platelet 
membrane, and which can be produced by a mammal (including an human) immunized with the 
peptides or polypeptides, which themselves happen to be immunogenic, or the immunogenic 
compositions of the invention, are also useful both therapeutically and diagnostically. The 
invention also provides the methods of using the peptides and polypeptides of the invention, and 
antibodies made using the peptides that are immunogenic and the immunogenic compositions 
of the invention, in therapeutic and diagnostic applications. 

The Gov allele-specific peptides or polypeptides can also be used diagnostically to detect 
the presence of Gov 3 or Gov 6 specific antibodies in human plasma or serum samples, using 
methods that are readily apparent to those skilled in the ait Such analyses would be useful in 
the investigation of cases of acquired alloimmune thrombocytopenia, including PTP, PTPR, and 
NATP. In the latter case, this approach could also be used to detect the presence of Gov ajlele- 
specific antibodies in the mother of th aff cted fetus or newborn. Th pres nee of Gov allele- 
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specific antibodies can also be detected using platelets of known Gov phenotype. However, this 
approach has numerous technical disadvantages that are eliminated by th use of Gov allele- 
specific peptides or polypeptides for Gov allele-specific antibody detection. 

Administration to a person, who is suffering from, or at risk for, for example, PTP or 
PTPR, or a mother at risk for passing NATP-causing allbantibodies to her fetus, of one of the 
peptides or polypeptides, that would be bound by the anti-Gov alloantibodies in such a person, 
would inhibit the binding of the alloantibodies to the person's (or the fetus's platelets and thereby 
inhibit the platelet destruction and abnormal bleeding associated with the disorders. Alternatively, 
administration to such a person of antibodies (particularly human antibodies), which are produced 
using a peptide or polypeptide of the invention, which is immunogenic by itself, or an 
immunogenic composition of the invention, and which are specific for the Gov allelic form of the 
CD109 on the person's platelets which is associated with the PTP or PTPR, from which the 
person is suffering or may suffer, would induce the production of anti-idiotypic antibodies, which, 
in turn, would inhibit the platelet-destructive effects of the anti-Gov alloantibodies, which are 
generated by the person's own immune system and which are causing or threatening to cause 
the PTP, PTPR or NATP. These therapeutic applications of peptides and polypeptides of the 
invention would be especially useful in treating NATP in a newborn, because the alloantibody 
giving rise to NATP in the newborn is not continuously produced by the immune system of the 
newborn, but rather is acquired passively, and therefore in limited, non-replenished quantity, by 
the newborn from its mother. 

Thus, in accordance with one aspect of the present invention, an oligonucleotideprobe 
is provided that hybridizes to a portion of the CD109 gene, or a portion of CD109-encoding 
mRNA or cDNA prepared from such mRNA, which portion includes a nucleotide corresponding 
to the internal nucleotide of the codon for the amino acid at position 703 of the lull-length CD109 
molecule, and that is capable of distinguishing one Gov allele from the other through the ability 
to hybridize under stringent conditions to the portion in question only when the nucleotide in 
question is A (or dA), when the probe is to detect the Gov 3 allele, or C (or dC), when the probe 
is to detect the Gov* allele. The nucleotide in question is at position 2108 of the coding region 
of the CD109 cDNA sequence and lies at position 2108 in SEQ ID NO:1. The cDNA sequence 
has A at this position, and so is the sequence corresponding to the Gov 3 allele. The nucleotide 
in question lies at position 954 of the sequence presented as SEQ ID NO:5 and contains an A 
in this position, and thus also corresponds to Gov 3 allele. 

The Gov allele-specific oligonucleotide hybridization probes of the invention may 
comprise genomic DNA, cDNA, or RNA, although preferably it is DNA. Such oligonucleotide 
probes can be synthesised by automated synthesis and will preferably contain about 10-30 
bases, although as understood in th oligonucleotide probe hybridization assay art, as few as 8 
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and as many as about 50 nucleotides may be useful, depending on the position within the probe 
where the potential mismatch with the target is located, the extent to which a label on the probe 
might interfere with hybridization, and the physical conditions (e.g., temperature, pH, ionic 
strength) under which the hybridization of probe with target is carried out 
In accordance with another aspect of the present invention, a test kit for Gov alloantigen typing 
is provided comprising: 

(a) means for amplifying nucleic acid that comprises at least a portion of a CD109 gene, a 
CD109-encoding mRNA, or a CD1 09 cDNA made from such RNA, wherein the portion includes 
a nucleotide (nucleotide 2108 in SEQ ID NO:1, or nucleotide 954 in SEQ ID NO:5) corresponding 
to the internal nucleotide of the codon encoding amino acid 703 of the full length CD109 protein. 

(b) an oligonucleotide probe of the invention, that distinguishes one Gov allele from the other. 
The "means for amplifying" will, as the skilled will readily understand, depend on the amplification 
method to be used. Thus, for example, these means might include suitable primers, a suitable 
DNA polymerase, and the four 2 , -deoxyribonucleoside triphosphates (dA, dC, dG, dT), if 
amplification is to be by the PCR method. To cite another example, if the amplification is to be 
by a method relying on transcription, such as the 3SR method, the means will include two 
primers, at least one of which, when made double-stranded, will provide a promoter, an RNA 
polymerase capable of transcribing from that promoter, a reverse transcriptase to function in 
primer-initiated, DNA-directed and RNA-directed, DNA polymerization and possibly also in 
RNAse H degradation of RNA to free DNA strands from RNA/RNA hybrids, the four 
ribonucleoside triphosphates (A, C, G and U), and the four 2'-<Jeoxyribonucleoside triphosphates. 
In another example, if the amplification is by the ligase chain reaction, the means will include two 
oligonucleotides (DNAs) and a suitable DNA ligase that will join the two if a target, to which both 
can hybridize adjacent to one another in ligatable orientation, is present 

The oligonucleotide probes of the invention will preferably be labelled. The label may be 
any of the various labels available in the art for such probes, including, but not limited to ^P; ^S; 
biotin (to which a signal generating moiety, bound to or complexed with avidin can be 
complexed); a fluorescent moiety; an enzyme such as alkaline phosphatase (which is capable 
of catalysing a chromogenic reaction); digoxigenin, as described above; or the like. 

As indicated in the examples, RFLP analysis can be employed, using BstNl (or 
isoschizomers thereof), in analysing cDNA or genomic DNA (with or without amplification) to 
determine Gov genotype. As indicated further in the examples, electrophoretic SSCP analysis 
may be used to determine Gov genotype. And as indicated in the examples, the hybridization 
studies outlined above may use fluorescent probes, and may be directly coupled to the DNA 
amplification step, as in "Real-Time PCR" or related methods. 
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There has also been provided, in accordance with another aspect of the present 
invention, a method of typing for Gov allele-specific target sequence in a CD 109 nucleic acid 
derived from a subject, comprising the steps of, 

(a) obtaining, by a target nucleic acid amplification process applied to mRNA from human 
platelets, endothelial cells, or T cells, an assayable quantity of amplified nucleic acid with a 
sequence that is that of a subsequence (or the complement of a subsequence) of the mRNA that 
encodes a CD109 said subsequence including the nucleotide at the position in the mRNA 
corresponding to position 2108 in SEQ ID NO:1 or to nucleotide 954 in SEQ ID NO:5; and 

(b) analyzing (e.g., in a nucleic acid probe hybridization assay employing an oligonucleotide 
probe or probes according to the invention) the amplified nucleic acid obtained in step (a) to 
determine the base or bases at the position in the amplified nucleic acid that corresponds to 
position 2108 in SEQ ID NO:1 or to nucleotide 954 in SEQ ID NO:5. It is noteworthy that, if the 
product of the amplification is double-stranded DNA, analysis for Gov genotype can be carried 
out by a RFLP (restriction fragment length polymorphism) analysis comprising exposing the 
amplified DNA to the restriction endonuclease BstNl (or isoschizomer thereof) under conditions 
whereby the DNA will be cleaved if it includes a site for cleavage by that enzyme. Such DNA, 
prepared from mRNA encoding the Gov 5 alloantigen, containing a C rather than an A at the 
position corresponding to nucleotide 2108 in SEQ ID NO:1 (or to nucleotide 954 in SEQ ID 
NO:5), includes a recognition site for that endonuclease, while such DNA prepared from mRNA 
encoding the Gov 3 alloantigen, does not If the analysis, by whatever method, of the amplified 
nucleic acid reveals that there is only an A (or dA) at the position corresponding to position 
12108, the platelets (and blood from which they came) have only the Gov 3 alloantigen, and the 
individual from whom the platelets came, is homozygous for Gov 3 . Alternatively, if the analysis 
of the amplified nucleic acid reveals that there is only a C (or dC) at the position corresponding 
to position 2108, the platelets (and blood from which they came) have only the Gov 6 alloantigen 
and the individual, from whom the platelets came, is homozygous for the Gov** allele. Finally, if 
the analysis indicates that there is either an A (or dA) or a C (or dC) at that position, the platelets 
(and blood from which they came) have both Gov alloantigens, and the individual from whom the 
platelets came, is heterozygous for Gov alloantigen. 

In one application of the typing methods of the invention, the methods are applied to two 
individuals to determine whether blood or platelets from one would provoke an alloimmune 
response, and possibly FTP or PTPR, in the other. The typing method can be applied with a man 
and a woman, who are contemplating conceiving or have conceived a child together, to 
determine the risk that the child would be at risk for NATP and the risk that the woman would be 
at increased risk for PTP or PTPR. If the woman were heterozygous for the Gov alloantigens 
there would be, due to Gov alloantigen incompatibility, no risk of NATP and no increased risk for 
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the woman of PTP or PTPR. If, however, the woman were homozygous for one of the Gov 
alloantigens, there would be, due to Gov alloantigen incompatibility, risk of NATP in a child and 
increased risk of PTP or PTPR for the woman, unless the man is homozygous for the same Gov 
alloantigen as is the woman. 

In accordance with yet another aspect of the present invention, a method of typing an 
individual for Gov alloantigen is provided that comprises analyzing the genomic DNA of the 
individual to determine the Gov alloantigen(s) of the individual. Applications of this method are 
substantially the same as those of the method of the invention for typing for Gov alloantigen that 
begins with platelet, endothelial cell, or T cell mRNA. 

This method of the invention, entailing analysis of genomic DNA, can be earned out in 
substantially the same way as outlined above for analysis of mRNA, namely first amplifying the 
genomic DNA and then analyzing to product of the amplification to ascertain whether there is only 
dA, only dC, or both dA and dC, at the position in the coding region of the genomic DNA 
corresponding to position 2108 in SEQ ID NO:1, or to nucleotide 954 in SEQ ID NO:5. 

In accordance with a further aspect of the present invention, a test kit for Gov alloantigen 
typing is provided comprising a non-human antibody (or antibodies) that distinguishes the two 
allelic forms of CD109. The antibody (or antibodies) of the kit may be polyclonal, or preferably 
monoclonal, and in addition to its (their) specificity for either but not both Gov alloantigens (on 
the surface of platelets or separated therefrom) or the CD1 09 subunit of one but not both of such 
alloantigens, typically will recognise a polypeptide molecule encoded by a nucleotide sequence 
encoding at least amino acid 703 of a CD109 polypeptide (the amino acid at the position 
corresponding to nucleotides 2107 - 2109 in SEQ ID NO:1 , or to nucleotides 953 - 955 in SEQ 
ID NO:5). 

DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

The following definitions are provided in order to aid those skilled in the art in 
understanding the detailed description of the present invention. 

The term "alloantigens" refers to antigens of an individual that are responsible for eliciting 
an alloimmune response. 

The phrase "alloimmune response" refers to an immune response, which occurs when 
antibodies from one individual react against antigens of a different individual of the same species. 

The phrase "anti-idiotypic antibodies" refers to antibodies which can bind endogenous or 
foreign idiotypic antibodies and which can be used to treat or prevent pathological conditions 
associated with an immune response to a foreign alloantigen. 

The phrase "Gov'VGov* biallelic system" refers to a system of human platelet alloantigens 
in which an individual can be homozygous for either Gov 8 or Gov 5 allelic forms of CD 109, or an 
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individual can be Gov^Gov* heterozygous for CD109. 
"GPI" refers to glycosylphosphatidylinositol. 
The term "NATP" refers to neonatal alloimmune thrombocytopenia, 
"Nucleic acid" includes DNA and RNA, whether single or double stranded. Th term is 

also intended to include a strand that is a mixture of nucleic acids and nucleic add analogs 

and/or nucleotide analogs, or that is made entirely of nucleic acid analogs and/or nucleotide 

analogs. 

"Nucleic acid analogue" refers to modified nucleic acids or species unrelated to 
nucleic acids that are capable of providing selective binding to nucleic acids or other nucleic 
acid analogues. As used herein, the term "nucleotide analogues" includes nucleic acids 
where the intemucleotide phosphodiester bond of DNA or RNA is modified to enhance bio- 
stability of the oligomer and "tune" the selectivity/specificity for target molecules (Ulhmann, et 
al y 1990, Angew. Chem. Int Ed. Eng., 90: 543; Goodchild, 1990, J. Bioconjugate Chera, I: 
165; Englisch et a/., 1991, Angew, Chem. Int. Ed. Eng., 30: 613). Such modifications may 
include and are not limited to phosphorothioates, phosphorodithioates, phosphotriesters, 
phosphoramidates or methylphosphonates. The 2'-0-methyl, allyl and 2'-deoxy-2'-fluoro RNA 
analogs, when incorporated into an oligomer show increased biostability and stabilization of 
the RNA/DNA duplex (Lesnik et al., 1993, Biochemistry, 32: 7832). As used herein, the term 
"nucleic acid analogues" also include alpha anomers (a-DNA), L-DNA (minor image DNA), 2'- 
5' linked RNA, branched DNA/RNA or chimeras of natural DNA or RNA and the above- 
modified nucleic adds. For the puposes of the present invention, any nucleic acid containing 
a "nucleotide analogue* shall be considered as a nucleic acid analogue. Backbone replaced 
nucleic add analogues can also be adapted to for use as immobilised selective moieties of 
the present invention. For purposes of the present invention, the peptide nucleic adds 
(PNAs) (Nielsen et al, 1993, Anti-Cancer Drug Design, 8: 53; Engels et al., 1992, Angew, 
Chem. Int. Ed. Eng., 31: 1008) and carbamate-bridged morpholino-type oligonucleotide 
analogs (Burger, D.R., 1993, J. Clinical Immunoassay, 16: 224; Uhlmann, et al., 1993, 
Methods in Molecular Biology, 20,. "Protocols for Oligonucleotides and Analogs," ed. Sudhir 
Agarwal, Humana Press, NJ, U.S.A., pp. 335-389) are also embraced by the term "nucleic 
acid analogues". Both exhibit sequence-spedfic binding to DNA with the resulting duplexes 
being more thermally stable than the natural DNA/DNA duplex. Other backbone-replaced 
nucleic acids are well known to those skilled in the art and may also be used in the present 
invention (See e.g., Uhlmann et al 1993, Methods in Molecular Biology, 20, "Protocols for 
Oligonudeotides and Analogs," ed. Sudhir Agrawal, Humana Press, NJ, U.SA, pp. 335). 

The term "PTP" refers to post-transfusion purpura. 

The term "PTPR" refers to post-transfusion platelet refractorines. 
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n SNP n refers to single nucleotide polymorphism. 

Th standard, one-letter codes "A," "C, n "G, " and "T are used herein for the nucleotides 
adenylate, cytidylate, guanylate, and thymidylate, respectively. The skilled will understand that, 
in DNAs, the nucleotides are ^'-deoxyribonucleotide-S-phosphates (or, at the 5*-end, possibly 
triphosphates) while, in RNAs, the nucleotides are ribonucleotide-S'-phosphates (or, at the 5'-end, 
possibly triphosphates) and uridylate (U) occurs in place of T. "N" means any one of the four 
nucleotides. On occasion herein, dA, dC, dG and dT might be used for the respective 
2'-deoxyribonucIeotides. 

Unless otherwise specified or required by the context, "nucleic add" means DNA or RNA 
and "nucleotide" means ribonucleotide or 2 , -deoxyribonucleotide. 

Reference herein to a "full-length" CD109 molecule or protein means the 1445-amino 
acid-long polypeptide, for which the amino acid sequence, deduced from a cDNA sequence, is 
provided in SEQ ID NO:1 and in SEQ ID NO:3 and which is denoted as the full-length translated 
product (i.e., including the amino-terminal leader peptide, and excluding carboxyl-terminal 
processing associated with GPI anchor addition). The Gov 3 alloantigen bearing form of CD109 
may be referred to herein as ^yr CD109. The Gov 5 alloantigen bearing form of CD109 may be 
referred to herein as 703 Ser CD109. 

It has been determined that a single nucleotide of the CD109 gene is responsible for the 
Gov polymorphism in CD109. Extensive serological studies initially demonstrated that the 
polymorphism underlying the Gov system resides solely on the CD109 molecule [Sutherland, 
D.R. (1 991 ); Smith et ai. (1 995)]. Further, extensive deglycosylation of CD1 09 does not affect the 
binding the anti-Gov 8 and anti-Gov 6 antibodies to molecules of the appropriate phenotype, or to 
cells bearing the appropriate CD109 variant, indicating that carbohydrate residues are not 
involved in the formation of Gov antigenic epitopes. Further work has indicated that the Gov 
allele-specific antibody binding can however, be abrogated by denaturation of CD109 with the 
detergent SDS [Smith et al. (1995)]. Taken together, these observations indicate that the Gov 
alleles of CD109 are protein epitopes that are likely defined by the primary amino acid sequence 
ofCD109. 

Following the isolation of a CD109 cDNA the nature of the two Gov alleles was 
characterised further using platelet RNA-derived cDNA in the polymerase chain reaction ("PCR"). 
Platelet mRNA transcripts were obtained from serologically defined Gov 873 , Gov 8 * and Gov* 5 
individuals. The RNA was then converted to cDNA, and the entire CD109 cDNA coding region 
was then amplified as a series of overlapping PCR products. The Gov 3 [SEQ ID NO:1] and Gov 6 
[SEQ ID NO:3] alleles differ by an A to C substitution at position 2108 of the coding region of the 
CD109 cDNA. This single nucleotide polymorphism also results in a BstNl restriction site in the 
Gov 5 allele that is not present in its Gov 8 counterpart On the basis of this BstNl site, Gov 8 can 
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by distinguished from Gov 1 ' by restriction fragment length polymorphism (RFLP) analysis. This 
single nucleotide polymorphism can also be detected by SSCP analysis, and by allele-specific 
hybridization studies, including "Real-Time" PCR analyses. 

As a result of this A 2108 C single nucleotide polymorphism, the Gov 3 all le [SEQ ID NO:2] 
of CD109 contains a Tyr at position 703 of the full-length protein, while the Gov 5 allele [SEQ ID 
NO:4] contains a Ser in this position. The polymorphism does not alter the ability of Gov 3 and 
Gov 6 homozygous platelets to adhere to collagen types I, III and V. Additionally, the binding of 
anti-Gov 3 and anti-Gov* antibodies to platelets of the appropriate phenotype did not interfere with 
platelet adhesion to any of the above collagen types. Thus, while the Tyr^Ser results in the 
formation of the Gov alloantigen epitopes, it does not appear to impair platelet function. 

Identification and characterisation of the Gov alloantigen system permits pre- and 
post-natal diagnosis of the Gov phenotype of an individual, providing a warning for the possibility 
of NATP, PTP and PTPR. Allelic Gov typing of CD1 09 equates with the Gov status of the CD1 09 
protein of an individual. The Gov system led to diagnostic and therapeutic strategies to avoid or 
control diseases that result from Gov incompatibility. The present invention can be applied to 
these tasks and goals in a variety of ways, illustrative examples of which are discussed below. 

For example, an oligonucleotide probe can be synthesized, in accordance with the 
present invention, that will hybridize to a cDNA segment, derived from CD109 mRNA, that 
contains the nucleotide G at polymorphic nucleotide 2108 (nucleotide=guanylate). Alternatively, 
an oligonucleotide probe can be synthesized that will hybridize with a CD109 cDNA segment 
containing the base adenine at nucleotide 2108. (nucleotide=adenylate). These allele-specific 
probes can be appropriately labelled and added to the generated cDNA segments under 
annealing conditions, such that only one of the allele-specific probes hybridizes and can be 
detected, thereby identifying the specific Gov 8 or Gov 6 allele. In accordance with conventional 
procedures, the design of an oligonucleotide probe according to the present invention preferably 
involves adjusting probe length to accommodate hybridization conditions (temperature, ionic 
strength, exposure time) while assuring allele-specificity. A length often to thirty nucleotides is 
typical. 

Diagnostic kits can also be used, in accordance with the present invention, for the 
determination and diagnosis of alloantigen phenotypes via the procedures described herein. 
Such a kit can include, among others, , antibodies or antibody fragments to an antigenic 
determinant expressed by either of the above-described Gov 3 - and Gov 5 - encoding sequences. 
These antibodies would react with the blood sample of an individual so as to indicate whether 
that individual has a Gov 3 or Gov 5 phenotype. Alternatively, all the reagents required for the 
detection of nucleotide(s) that distinguish the Gov alloantigens, by means described herein, can 
be provided in a single kit that us s isolated genomic DNA, platelet (or other cellular) mRNA or 



13 



WO 02/070738 



PCT/CA02/00291 



total RNA, or corresponding cDNA from an individual. A kit containing a labelled probe that 
distinguishes, for example, nucleotide 2108 of CD109 can be utilised for Gov alloantigen 
genotyping and phenotyping. 

A further beneficial use of the nucleotide sequences that distinguish the Gov 3 allele from 
the Gov*' allele is to obtain or synthesize the respective expression product, in the form of a 
. peptide or polypeptide, encoded by these nucleotide sequences. These polypeptides can be 
used to generate antibodies for diagnostic and therapeutic uses, for example, with regard to 
pathological conditions such as PTP, PTPR or NATP. These polypeptides can also be used 
diagnostically to detect the presence of Gov 3 or Gov 5 specific antibodies in patient plasma or 
serum, or used therapeutically (see below; assays may be adopted, for example, from US Patent 
No. 5,851 ,788). 

A polypeptide within the present invention which can be used for the purpose of 
generating such antibodies preferably comprises an amino-acid sequence that corresponds to 
(i.e., is coincident with or functionally equivalent to) a fragment of the CD109 molecule that 
includes amino acid 703. When amino acid 703 is Tyrosine, the polypeptide can be used, as 
described above, to produce antibodies that specifically bind the Gov 3 form of CD109; in contrast, 
when it is Serine, antibodies can be obtained that specifically recognise the Gov 5 form. The class 
of polypeptides thus defined, in accordance with the present invention, is not intended to include 
the native CD109 molecule, but does encompass fragments of the molecule, as well as synthetic 
polypeptides meeting the aforementioned definition. 

Although the length of a polypeptide within this class is not critical, the requirement for 
immunogenicity may require that the polypeptide be attached to an immunogenicity-imparting 
carrier. Such earners include a particulate earner such as a liposome or a soluble macromolecule 
(protein or polysaccharide) with a molecular weight in the range of about 10,000 to 1,000,000 
Daltons Additionally, it may be desirable to administer the polypeptide with an adjuvant, such as 
complete Freund's adjuvant For artificial polypeptides, as distinguished from CD109 fragments, 
maximum length is determined largely by the limits of techniques available for peptide synthesis, 
which are currently about fifty amino acids. Thus, a synthetic polypeptide of the present invention 
is preferably between four to about fifty amino acids in length. 

In the context of the present invention, the term "antibody" encompasses monoclonal and 
polyclonal antibodies produced by any available means. Such antibodies can belong to any 
antibody class (IgG, IgM, IgA, etc.) and may be chimeric. Examples of the preparation and uses 
of polyclonal antibodies are disclosed in U.S. Patent Nos. 5,512,282, 4,828,985, 5,225,331 and 
5,124,147 which are incorporated by reference in their entirety The term "antibody" also 
encompasses antibody fragments, such as Fab and F(ab')2 fragments, of anti-Gov 3 or anti-Gov 5 
antibodies, conjugates of such fragments, and so-called "antigen binding proteins" (single-chain 

14 



WO 02/070738 



PCT/CA02/00291 



antibodies) which are based on anti-Gov 3 or anti-Gov 5 antibodies, in accordance, for example, 
with U.S. Pat No. 4,704,692, the contents of which are hereby incorporated by reference. 
Alternatively, monoclonal antibodies or fragments thereof within the present invention can be 
produced using conventional procedures via the expression of isolated DNA that encodes 
variable regions of such a monoclonal antibody in host cells such as E. coli (see, e.g„ Ward et 
al., Nature, 341:544-546 (1989)) or transfected murine myeloma cells (see Gillies et al., 
Biotechnol. 7:799-804 (1989); Nakatani et al., Biotechnol. 7:805-810 (1989)). For additional 
examples of methods of the preparation and uses of monoclonal antibodies, see U.S. Patent 
Nos. 5,688,681, 5,688,657, 5,683,693, 5,667,781, 5,665,356, 5,591,628, 5,510,241, 5,503,987, 
5,501,988, 5,500,345 and 5,496,705 that are incorporated by reference in their entirety. 

While human alloantisera currently used for serological typing are specifically excluded 
from this definition, the use of CD1 09 or Gov allele-specific peptides to detect anti-Gov antibodies 
in human plasma or serum, or to determine the specificity of such alloantibodies, are specifically 
included. Similarly, the use of such CD109 peptides or Gov allele-specific peptides to purify 
CD109 antibodies, or allele-specific CD109 antibodies from human serum is specifically included. 
Similarly, the use in vitro of such CD109 peptides or Gov allele-specific peptides to deplete allele- 
specific antibody activity from human serum samples, or to block CD109 antibody binding, or 
allele-specific antibody binding, is specifically included. 

Diagnostic applications of these antibodies are exemplified, according to the present 
invention, by the use of a kit containing an anti-Gov 3 or an anti-Gov 5 antibody, which undergoes 
a reaction with a sample of an individual's blood to determine a Gov 3 or Gov 6 platelet phenotype. 
Such a reaction involves the binding of anti-Gov 3 antibody to Gov 3 antigen or the binding of 
anti-Gov 5 antibody to Gov 6 antigen. The observation of antibody-antigen complex in a blood 
sample would indicate a positive result A kit of this type could be used to diagnose, or to help 
prevent the occurrence of pathological conditions like FTP, PTPR, or NATP. 

A polypeptide of the present invention that is recognised specifically by anti-Gov 3 or 
anti-Gov 5 antibodies can also be used therapeutically. Thus, antibodies raised against such a 
polypeptide can be employed in the generation, via conventional methods, of anti-idiotypic 
antibodies, that is, antibodies that bind an anti-Gov 3 or anti-Gov 5 antibody. See, e.g., U.S. Pat 
No. 4,699,880, the contents of which are hereby incorporated by reference. Such anti-idiotypic 
antibodies would bind endogenous or foreign anti-Gov antibodies in the blood of an individual, 
which would treat or prevent pathological conditions associated with an immune response to a 
"foreign" Gov alloantigen. Alternatively, a polypeptide within the present invention can be 
administered to an individual, with a physiologically-compatible carrier, to achieve the same 
qualitative effect, namely, the selective reduction or elimination of circulating anti-Gov antibodies 
from a patient suffering or at risk from an immune response, or th abrogation by competitive 
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binding to administered peptide, of the binding of Gov-specific antibodies to the platelets of such 
an individual 

The present invention is further described below by reference to the following, illustrative 
examples. 

EXAMPLE 1 : PCR Amplification and Analysis of PCR Products 

Platelet total RNA was isolated from EDTA anticoagulated blood of Gov 33 and Gov 66 
individuals in the manner described in Lymann et a)., Blood 75:2343-48 (1990). First, platelet 
mRNA in 10 ^l aliquots was heated to 70°C for 10 minutes and quickly cooled on ice before 
reverse transcription. The first strand cDNA was then synthesized using 10 oligo dT, 40 units 
RNAsin (Promega), 2 mM of each dNTP (dN triphosphate) (Pharmacia), 500 units of cloned 
MMLV reverse transcriptase and 5x enzyme buffer (Gibco) in a total volume of 50 jiL The cDNA 
synthesis was carried out at 42°C for 45 minutes and was stopped by chilling to 0°C. 

Overlapping sets of oligonucleotide primers (Table 2) based on the sequence of CD109 
were then used to amplify by PCR the entire coding region of platelet CD109 in 8 overlapping 
segments that spanned the entire CD109 open reading frame. 



TABLE 2: 



Fragm 
ent 


Sense Primer 


Antisense Primer 




Size 
(bp) 


Annee 
ng 
Tempi 
atun 
(°C) 


1 


K1-80 

5* GTAGCCCAGGCAGACGCC 3' 


(-24) 


K1-650 
5' 

GTGACAACCACTGTTGGATCAA 
3 


544 


568 


59 


2 


K1-1 

5 1 CGCATTGTTACACTCTTCTC 3' 


445 


K1-1120 

5' TACATTTCTTGAAATACCTG 3 


1014 


570 


50 


3 


K1-1022 

S GATTCTTCAAATGGACTTT 3 


910 


K1-REV-1 

5' GGCTGTGTCACAGAGATC 3 


1747 


838 


50 


4 


K1-1400 

ff TGAATTCCCAATCCTGGAGGA 
3 


1291 


GSP3 

5' GCCACCCAAGAAGTGATAGA 
3 


2165 


875 


55 


5 


K1-M43 

5" TTCAGGAATGTGGACTCTGG 
3 


1898 


6R4N 

CGGCTTCAAGGAAACATCT 3 


2998 


1101 


56 
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6 


K1-3080 

5" CTGGGAGCACTTGGTTGTCA 
3' 


2948 


1-5N 
5* 

CAGCAACATCTAAATCAAAGGC 
3' 


3859 


912 


56 


7 


K1-3570 

5 ACAATTTCAGACTTCTGAGG 
3' 


3462 


7U3N 

5 OACAGCCAAAbMUOAIAo 


4337 


876 


50 


8 


K1-3920 

5' GACGAAGATCTATCCAAAATC 
3' 


3812 


K1-4600 

5* GCTAGGACCTGTTGTACACC 
3' 


4489 


678 


55 



Table 2 lists the position of the 5' end of each oligonucleotide with respect to the CD1 09 
cDNA sequence, which includes both 3' and 5* untranslated regions, is noted in parentheses. 
The CD109 ORF encompasses nucleotides 1-4335 of the published CD109 cDNA, and 
corresponds exactly to the CD109 cDNA sequence presented in SEQ ID NO:1. The size of each 
PCR product, and the annealing temperature used for the corresponding primer pair, is listed. 

PCR reactions (50 :l) containing 1x PCR buffer (Gibco Life Technologies), 1.5 mM MgCfe, 
200 :M of each dNTP, 1 :M of each primer, 1 25 units Taq polymerase (Gibco Life Technologies), 
and 3 :l cDNA underwent 40 cycles of 94°C (45 seconds), primer-specific annealing temperature 
(Table 2; 45 seconds), and 72°C (45 - 60 seconds), using a Perkin Elmer 2400 thermocycler. 
PCR products (30 :l) were subsequently size-separated electrophoretically on a 1.2 % 
agarose/TAE gel containing 1 :g/ml ethidium bromide. Bands were subsequently excised and 
purified (50 :l) using the QIAquick (Qiagen) kit for direct sequencing and subcloning. Sequencing 
reactions (3-5 :l purified product per reaction) were carried out using the Thermosequenase 
Cy5.5 dye terminator sequencing kit (Amersham Pharmacia Biotech) and the same primers that 
had been used for initial PCR amplification (Table 2), or selected internal CD109-spedfic primers 
as appropriate, and were subsequently analysed using the Open Gene automated DNA 
sequencing system (Visible Genetics). In parallel, PCR products were cloned into PmeWigested 
pMAB1, a pBS SK(-) (Stratagene) derivative containing a Pmel restriction site within the 
polylinker. Resultant plasmid clones were analysed by alkaline lysis/restriction digestion, and as 
appropriate (and following an additional overnight 13% PEG/ 1.6 M NaCI. precipitation), by DNA 
sequence analysis as above. By combining direct PCR sequencing and the analysis of subcloned 
fragments, it was ensured that the DNA sequence of each PCR-derived cDNA fragment was 
obtained independently at least twice, with each fragment being sequenced in both directions in 
its entirety. 

This analysis revealed that the CD109 cDNA sequences of Gov 33 and Gov 1 * individuals 
differed by a single nucl otide at position 2108 of the sequence shown in SEQ ID NO:1 . Gov 2 * 3 
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individuals have an A at position 2108, whereas Gov individuals have a C at the same position. 
This change results in a Tyr-Ser amino acid polymorphism at residue 703 of the full-length 
CD109 polypeptide chain. This single nucleotide polymorphism also results in a BstNl restriction 
site in the Govb allele that is not present in the Gov 3 allele. Analysis of the other regions of the 
CD109 cDNA in their entirety revealed no other nucleotide differences that segregated with Gov 
phenotype p.e., that could be used to distinguish the Gov 8 allele from the Gov 5 allele). 

To facilitate subsequent genomic DNA analyses of the Gov** alleles, the intron/exon 
junctions of the exon bearing the putative Gov-specific nucleotide substitution identified above, 
as well as the DNA sequence of the flanking introns, were determined. CD109 cDNA-specific 
oligonucleotides binding in the vicinity of this substitution were used for the direct sequencing of 
p4L10, a pCYPACJ -derived PAC clone bearing the human CD109 locus using the Open Gene 
system (Visible Genetics) as above. The nucleotide sequence of the Gov polymorphism- 
containing exon, as well as of the flanking introns, is presented in SEQ ID NO:5. The Gov 
polymorphism lies at nucleotide position 954 in SEQ ID NO:5. Subsequent work has mapped the 
intron-exon structure of the entire human CD109 locus, and has determined that the Gov single 
nucleotide polymorphism of CD109 lies in exon 19 of the CD109 gene. 

EXAMPLE 2: RFLP Analysis of PCR Amplified Genomic DNA 

The A-C Gov CD109 polymorphism corresponds to the internal nucleotide of the first 
complete codon of exon 19 of the CD109 gene. As this exon comprises only 118 nucleotides, 
and the Gov polymorphism lies almost at the extreme 5' end of this exon, we determined the 
nucleotide sequence of both introns flanking this exon to facilitate subsequent genomic DNA 
analyses of the Gov** alleles. The DNA sequence of CD109 exon 19 and its flanking introns 
(CD109 introns 18 and 19) is presented as SEQ ID NO:5. To confirm that the A to C 
polymorphism at position 2108 of the CD109 open reading frame (nucleotide 2108, SEQ ID NO: 
1 ; nucleotide 954, SEQ ID NO:5) segregates with the Gov phenotype, RFLP analysis was earned 
out on PCR amplified genomic CD109 DNA using the BstNl restriction endonuclease, which 
recognises the DNA sequence 5* CCAGG 3' found in the Gov 6 cDNA (nucleotides position 2108 

- 21 12 in SEQ ID NO:3; the corresponding Gov 8 sequence, 5' ACAGG 3\ is nucleotides 2108 

- 2112 in SEQ ID NO:1). This enzyme does not cleave at 5' ACAGG 3' (found in Gov 8 ; 
nucleotides 2108-2112 in SEQ ID NO: 1). A 448 bp genomic fragment was PCR-amplified from 
Gov 88 , Gov 86 , and Gov 1 * individuals using the pair of oligonucleotides SEQ ID NO:9 and SEQ ID 
NO: 10. These oligonucleotides flank exon 19. The former binds within intron 18 (nucleotides 875 

- 892 SEQ ID NO:5), while the latter binds within intron 19 to the sequence complementary to 
nucleotides 1305 - 1322 of SEQ ID NO:5). The resultant 448 bp PCR product, when digested 
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with BstNl, yielded the restriction fragments predicted on the basis that the A to C polymorphism 
at position 2108 (SEQ ID NO: 1) segregates with the Gov phenotype. 

EXAMPLE 3: Hybridization Analysis of PCR Amplified Genomic DNA 

To further confirm that the A to C polymorphism at position 2108 of the CD109 open 
reading frame (nucleotide 2108, SEQ ID NO:1; nucleotide 954, SEQ ID NO:5) segregates with 
the Gov phenotype, we also performed an alternative analysis involving the selective 
hybridization of Gov allele-specific DNA probes to PCR amplified genomic CD109 DNA. Two 
primers flanking the polymorphic A-C site at position 2108 (SEQ ID NO:1; position 954, SEQ ID 
NO:5) were designed to amplify by PCR a 105 bp genomic DNA fragment containing the 
polymorphic site from genomic DNA isolated from Gov 33 , Gov 5 *, and Gov 1 * individuals. The first 
primer (SEQ ID NO:1 1) binds within intron 18 to nucleotides 902 - 928 of SEQ ID NO:5. The 
second primer (SEQ ID NO:12) binds within exon 19 to the sequence complementary to 
nucleotides 977 - 1 106 of SEQ ID NO:5. Two additional nucleotide probes were designed - one 
specific for the target sequence of the Gov 3 allele of the CD1 09 gene, and the other for the Gov** 
allele of the CD109 gene. The first probe (SEQ ID NO:13) overlaps the CD109 intron 18/exon 
19 junction, binds to the Gov 3 allele at nucleotides 935 - 974 of SEQ ID NO:5, and was tagged 
with the fluorescent dye 6-FAM. The second probe (SEQ ID NO:14), also overlapping the CD1 09 
intron 18/exon 19 junction, binds to the Gov 5 allele at the position corresponding to nucleotides 
935 - 971 of SEQ ID NO:5, and was tagged with the fluorescent dye VIC. Genomic DNA was 
isolated from Gov phenotyped human peripheral blood leukocytes, and PCR/hybridization 
analysis was carried out using Taqman real-time PCR technology (Perkin Elmer). Genomic DNA 
was amplified using primers SEQ ID NO:11 and SEQ ID NO:12, with each reaction additionally 
containing 100 nM FAM-labelled Gov 3 probe and 200 nM VIC-labelled Gov 5 probe. Allelic 
discrimination, based on allele-specific fluorescence, was then determined using a post-PCR 
plate reader (Perkin Elmer). In all cases, PCR/fluorescence-based Gov genotyping correlated 
with the Gov phenotype, indicating that the A to C polymorphism at position 2108 (SEQ ID NO: 
1) does indeed segregate with the Gov phenotype. 

EXAMPLE 4: SSP Analysis of PCR Amplified Genomic DNA 

To further confirm that the A to C polymorphism at position 2108 of the CD109 open 
reading frame (nucleotide 2108, SEQ ID NO:1; nucleotide 954, SEQ ID NO:5) segregates with 
the Gov phenotype, we also performed an alternative analysis involving SSCP analysis of PCR 
amplified genomic CD109 DNA. Two Gov allele-specific antisense oligonucleotides - SEQ ID 
NO:6 and SEQ ID NO:7 - differing by a single 3' nucleotide (and binding to sequence 
complementary to nucleotides 954 - 976 of SEQ ID NO:5, and of the Gov 6 counterpart of SEQ 
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ID NO:5, respectively), were combined with a common sense primer - SEQ ID NO:8 binds within 
intron 18 and which corresponds to nucleotides 752 - 773 of SEQ ID NO:5, to amplify a 225 bp 
genomic DNA fragment containing the Gov polymorphic site from genomic DNA isolated from 
Gov 33 , Gov 36 , and Gov 1 * individuals. In all cases, complete concordance between PCR-SSP 
analysis and Gov phenotyping was observed. 

SEQUENCES: 

SEQ ID NO: 1 consists of the entire 4335 nucleotide CD109 cDNA open reading frame encoding 
the Gov 3 allele. The Gov 3 allele comprises an A at nucleotide position 2108. 

SEQ ID NO:2 consists of the entire 1445 aa protein sequence produced from CD109 Go^cDNA. 
The Gov 3 allele comprises a Tyr at amino acid 703. 

SEQ ID NO: 3 consists of the entire 4335 nucleotide CD109 cDNA open reading frame encoding 
the Gov 1 ' allele. The Gov 6 allele comprises a C at nucleotide position 2108. 

SEQ ID NO: 4 consists of the entire 1445 aa protein sequence produced from the CD109 Gov 5 
cDNA. The Gov 6 allele comprises a Ser at amino acid 703. 

SEQ ID NO: 5 consists of the CD109 genomic DNA comprising CD109 exon 19 and the flanking 
introns, introns 18 and 19. The 118 nucleotide exon 19, comprising nucleotides 952 - 1069 of 
SEQ ID NO:5, corresponds to nucleotides 2106 - 2223 of SEQ ID NO: 1. The A to C Gov 
polymorphism of CD109 (corresponding to nucleotide 2108 of SEQ ID NO: 1) therefore 
corresponds to nucleotide 954 of SEQ ID NO:5. In the Gov 3 allele, nucleotide 954 is A, while in 
the Gov 6 allele nucleotide 954 is C. Thus, SEQ ID NO:5 corresponds to the Gov 3 allele of CD109. 
Within SEQ ID NO:5, nucleotides 1 -951 correspond to CD109 intron 18, while nucleotides 1070 
- 2608 correspond to intron 19. 

We note that nucleotides 2108-2112 of SEQ ID NO: 1, and the corresponding 
nucleotides 954 - 958 of SEQ ID NO:5, which consist of the sequence 5' ACAGG 3' (and which 
contains the Gov 3 allele-specific polymorphic nucleotide at its 5' end), is not cleavable by the 
restriction endonuclease BstNI. However, in the corresponding Gov 6 allele, the corresponding 
sequence - 5' CCAGG 3' - is cleavable by BstNI, and that the two Gov alleles can be 
discriminated on this basis. We note also that a group of restriction endonucleases - Bst2UI, 
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BstNl, BstOI, EcoRII, Maelll, MspR91, Mval, orScrFI (or one of their isoschizomers) - is capable 
of differentiating between the Gov 8 and Gov 5 alleles on this basis. 

SEQ ID NO:6 - SEQ ID NO: 14 comprise oligonucleotides for the PCR amplification of Gov 
polymorphism containing CD109 sequence from RNA, cDNA derived from RNA, or from genomic 
DNA, and for the Gov typing analyses of such amplified DNA fragments. 

SEQIDNO:6. 

SEQ ID NO: 3, an antisense oligonucleotide specific for the Gov 3 allele, binds to exon 19 
sequence complementary to nucleotides 954 - 976 of SEQ ID NO:5. SEQ ID NO:6 and SEQ ID 
NO: 7 (see below) differ by a single allele-specific 3* nucleotide 

SEQ ID NO:7. 

SEQ ID NO:7, an antisense oligonucleotide specific for the Gov 6 allele, binds to exon 19 
sequence complementary to nucleotides 954 - 976 of the Gov 5 counterpart of SEQ ID NO:5. 
SEQ ID NO:6 (see above) and SEQ ID NO:7 differ by a single allele-specific 3' nucleotide. 

SEQ ID NO:8. 

SEQ ID NO:8 binds within intron 18, and corresponds to nucleotides 752 - 773 of SEQ ID NO:5. 
SEQ ID NO:9. 

SEQ ID NO:9 binds within intron 1 8 (nucleotides 875 - 892 SEQ ID NO:5). . 
SEQIDNO:10. 

SEQ ID NO:10 binds within intron 19 to the sequence complementary to nucleotides 1305 - 1322 
ofSEQIDNO:5. 

SEQ ID NO:11 

SEQ ID NO:1 1 binds within intron 18 to nucleotides 902 - 928 of SEQ ID NO:5. 
SEQ ID NO:12. 

SEQ ID NO:12, binds within exon 19 to the sequence complementary to nucleotides 977 - 1006 
ofSEQIDNO:5. 



21 



WO 02/070738 



PCT/CA02/00291 



SEQIDNO:13. 

SEQ ID NO:13, specific for the Gov 8 allele, overlaps the CD109 intron 18/exon 19 junction, and 
binds to the Gov 8 allele at nucleotides 935 - 974 of SEQ ID NO:5. 

SEQ ID NO:14. 

SEQ ID NO:14, specific for the Go^allele, overlaps the CD109 intron 18/exon 19 junction, and 
binds to the Gov^allele at the position corresponding to nucleotides 935 - 971 of SEQ ID NO:5. 
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SEQUENCE LISTING 



<110> SCHUH, Andre and OUWEHAND, Willem 



<120> Diagnosis and Treatment of Blood Disorders 



<130> 3170/0005 



<160> 30 



<170> Patentln version 3.0 



<210> 1 

<211> 4335 

<212> DNA 

<213> HUMAN 

<220> 

<221> CDS 

<222> (1)..(4335) 



<400> 1 

atg cag ggc cca ccg etc ctg acc gec gec cac etc etc tgc gtg tgc 48 
Met Gin Gly Pro Pro Leu Leu Thr Ala Ala His Leu Leu Cys Val Cys 
1 5 10 15 

acc gec gcg ctg gec gtg get ccc ggg cct egg ttt ctg gtg aca gee 96 
Thr Ala Ala Leu Ala Val Ala Pro Gly Pro Arg Phe Leu Val Thr Ala 
20 25 30 

cca ggg ate ate agg ccc gga gga aat gtg act att ggg gtg gag ctt 144 
Pro Gly He He Arg Pro Gly Gly Asn Val Thr He Gly Val Glu Leu 
35 40 45 

ctg gaa cac tgc cct tea cag gtg act gtg aag gcg gag ctg etc aag 192 
Leu Glu His Cys Pro Ser Gin Val Thr Val Lys Ala Glu Leu Leu Lys 
50 55 60 

aca gca tea aac etc act gtc tct gtc ctg gaa gca gaa gga gtc ttt 240 
Thr Ala Ser Asn Leu Thr Val Ser Val Leu Glu Ala Glu Gly Val Phe 
65 70 75 80 
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gaa aaa ggc tct ttt aag aca ctt act ctt cca tea eta cct ctg aac 288 
Glu Lys Gly Ser Phe Lys Thr Leu Thr Leu Pro Ser Leu Pro Leu Asn 
- 85 90 ' 95 

agt gca gat gag att tat gag eta cgt gta ace gga cgt acc cag gat 336 
Ser Ala Asp Glu lie Tyr Glu Leu Arg Val Thr Gly Arg Thr Gin Asp 
100 105 110 

gag att tta ttc tct aat agt acc cgc tta tea ttt gag acc aag aga 384 
Glu He Leu Phe Ser Asn Ser Thr Arg Leu Ser Phe Glu Thr Lys Arg 
115 120 125 

ata tct gtc ttc att caa aca gac aag gec tta tac aag cca aag caa 432 
He Ser Val Phe He Gin Thr Asp Lys Ala Leu Tyr Lys Pro Lys Gin 
130 135 140 

gaa gtg aag ttt cgc att gtt aca etc ttc tea gat ttt aag cct tac 480 
Glu Val Lys Phe Arg He Val Thr Leu Phe Ser Asp Phe Lys Pro Tyr 
145 ' 150 155 ~ * 160 

aaa acc tct tta aac att etc att aag gac ccc aaa tea aat ttg ate 528 
Lys Thr Ser Leu Asn He Leu He Lys Asp Pro Lys Ser Asn Leu He 
165 170 175 

caa cag tgg ttg tea caa caa agt gat ctt gga gtc att tec aaa act 576 
Gin Gin Trp Leu Ser Gin Gin Ser Asp Leu Gly Val He Ser Lys Thr 
180 185 190 

ttt cag eta tct tec cat cca ata ctt ggt gac tgg tct att caa gtt 624 
Phe Gin Leu Ser Ser His Pro He Leu Gly Asp Trp Ser He Gin Val 
195 200 205 

caa gtg aat gac cag aca tat tat caa tea ttt cag gtt tea gaa tat 672 
Gin Val Asn Asp Gin Thr Tyr Tyr Gin Ser Phe Gin Val Ser Glu Tyr 
210 215 220 

gta tta cca aaa ttt gaa gtg act ttg cag aca cca tta tat tgt tct 720 
Val Leu Pro Lys Phe Glu Val Thr Leu Gin Thr Pro Leu Tyr Cys Ser 
225 230 235 240 

atg aat tct aag cat tta aat ggt acc ate acg gca aag tat aca tat 768 
Met Asn Ser Lys His Leu Asn Gly Thr He Thr Ala Lys Tyr Thr Tyr 
245 250 255 

ggg aag cca gtg aaa gga gac gta acg ctt aca ttt tta cct tta tec 816 
Gly Lys Pro Val Lys Gly Asp Val Thr Leu Thr Phe Leu Pro Leu Ser 
260 265 270 

ttt tgg gga aag aag aaa aat att aca aaa aca ttt aag ata aat gga 864 
Phe Trp Gly Lys Lys Lys Asn He Thr Lys Thr Phe Lys He Asn Gly 
275 280 285 

tct gca aac ttc tct ttt aat gat gaa gag atg aaa aat gta atg gat 912 
Ser Ala Asn Phe Ser Phe Asn Asp Glu Glu Met Lys Asn Val Met Asp 
290 295 300 

tct tea aat gga ctt tct gaa tac ctg gat eta tct tec cct gga cca 960 
Ser Ser Asn Gly Leu Ser Glu Tyr Leu Asp Leu Ser Ser Pro Gly Pro 
305 310 ~ 315 320 

gta gaa att tta acc aca gtg aca gaa tea gtt aca ggt att tea aga 1008 
Val Glu He Leu Thr Thr Val Thr Glu Ser Val Thr Gly He Ser Arg 
325. 330 335 
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aat gta age act aat gtg ttc ttc aag caa cat gat tac ate att gag 1056 
Asn Val Ser Thr Asn Val Phe Phe Lys Gin His Asp Tyr lie lie Glu 
340 345 350 

ttt ttt gat tat act act gtc ttg aag cca tct etc aac ttc aca gec 1104 
Phe Phe Asp Tyr Thr Thr Val Leu Lys Pro Ser Leu Asn Phe Thr Ala 
355 ~ 360 365 

act gtg aag gta act cgt get gat ggc aac caa ctg act ctt gaa gaa 1152 
Thr Val Lys Val Thr Arg Ala Asp Gly Asn Gin Leu Thr Leu Glu Glu 
370 375 380 

aga aga aat aat gta gtc at a aca gtg aca cag aga aac tat act gag 1200 
Arg Arg Asn Asn Val Val lie Thr Val Thr Gin Arg Asn Tyr Thr Glu 
385 390 395 400 

tac tgg age gga tct aac agt gga aat cag aaa atg gaa get gtt cag 1248 
Tyr Trp Ser Gly Ser Asn Ser Gly Asn Gin Lys Met Glu Ala Val Gin 
405 410 415 

aaa ata aat tat act gtc ccc caa agt gga act ttt aag att gaa ttc 1296 
Lys lie Asn Tyr Thr Val Pro Gin Ser Gly Thr Phe Lys lie Glu Phe 
420 425 430 

cca ate ctg gag gat tec agt gag eta cag ttg aag gee tat ttc ctt 1344 
Pro lie Leu Glu Asp Ser Ser Glu Leu Gin Leu Lys Ala Tyr Phe Leu 
435 440 445 

ggt agt aaa agt age atg gca gtt cat agt ctg ttt aag tct cct agt 1392 
Gly Ser Lys Ser Ser Met Ala Val His Ser Leu Phe Lys Ser Pro Ser 
450 455 460 

aag aca tac ate caa eta aaa aca aga gat gaa aat ata aag gtg gga 1440 
Lys Thr Tyr lie Gin Leu Lys Thr Arg Asp Glu Asn lie Lys Val Gly 
465 470 * 475 480 

teg cct ttt gag ttg gtg gtt agt ggc aac aaa cga ttg aag gag tta 1488 
Ser Pro Phe Glu Leu Val Val Ser Gly Asn Lys Arg Leu Lys Glu Leu 
485 490 495 

age tat atg gta gta tec agg gga cag ttg gtg get gta gga aaa caa 1536 
Ser Tyr Met Val Val Ser Arg Gly Gin Leu Val Ala Val Gly Lys Gin 
500 505 510 

aat tea aca atg ttc tct tta aca cca gaa aat tct tgg act cca aaa 1584 
Asn Ser Thr Met Phe Ser Leu Thr Pro Glu Asn Ser Trp Thr Pro Lys 
515 520 525 

gee tgt gta att gtg tat tat att gaa gat gat ggg gaa att ata agt 1632 
Ala Cys Val lie Val Tyr Tyr lie Glu Asp Asp Gly Glu lie lie Ser 
530 535 540 

gat gtt eta aaa att cct gtt cag ctt gtt ttt aaa aat aag ata aag 1680 
Asp Val Leu Lys lie Pro Val Gin Leu Val Phe Lys Asn Lys lie Lys 
545 550 555 560 

eta tat tgg agt aaa gtg aaa get gaa cca tct gag aaa gtc tct ctt 1728 
Leu Tyr Trp Ser Lys Val Lys Ala Glu Pro Ser Glu Lys Val Ser Leu 
565 570 575 

agg ate tct gtg aca cag cct gac tec ata gtt ggg att gta get gtt 1776 
Arg lie Ser Val Thr Gin Pro Asp Ser lie Val Gly lie Val Ala Val 
580 585 590 
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gac aaa agt gtg aat ctg atg aat gcc tct aat gat att aca atg gaa 1824 
Asp Lys Ser Val Asn Leu Met Asn Ala Ser Asn Asp lie Thr Met Glu 
595 600 605 

aat gtg gtc cat gag ttg gaa ctt tat aac aca gga tat tat tta ggc 1872 
Asn Val Val His Glu Leu Glu Leu Tyr Asn Thr Gly Tyr Tyr Leu Gly 
610 615 620 

atg ttc atg aat tct ttt gca gtc ttt cag gaa tgt gga etc tgg gta 1920 
Met Phe Met Asn Ser Phe Ala Val Phe Gin Glu Cys Gly Leu Trp Val 
625 630 635 640 

ttg aca gat gca aac etc acg aag gat tat att gat ggt gtt tat gac 1968 
Leu Thr Asp Ala Asn Leu Thr Lys Asp Tyr lie Asp Gly Val Tyr Asp 
645 650 655 

aat gca gaa tat get gag agg ttt atg gag gaa aat gaa gga cat att 2016 
Asn Ala Glu Tyr Ala Glu Arg Phe Met Glu Glu Asn Glu Gly His lie 
660 665 670 

gta gat att cat gac ttt tct ttg ggt age agt cca cat gtc cga aag 2064 
Val Asp lie His Asp Phe Ser Leu Gly . Ser Ser Pro His Val Arg Lys 
675 680 685 

cat ttt cca gag act tgg att tgg eta gac acc aac atg ggt tac agg 2112 
His Phe Pro Glu Thr Trp lie Trp Leu Asp Thr Asn Met Gly Tyr Arg 
690 695 700 

att tac caa gaa ttt gaa gta act gta cct gat tct ate act tct tgg 2160 
lie Tyr Gin Glu Phe Glu Val Thr Val Pro Asp Ser lie Thr Ser Trp 
705 710 715 720 

gtg get act ggt ttt gtg ate tct gag gac ctg ggt ctt gga eta aca 2208 
Val Ala Thr Gly Phe Val lie Ser Glu Asp Leu Gly Leu Gly Leu Thr 
725 730 735 

act act cca gtg gag etc caa gcc ttc caa cca ttt ttc att ttt ttg 2256 
Thr Thr Pro Val Glu Leu Gin Ala Phe Gin Pro Phe Phe lie Phe Leu 
740 745 750 

aat ctt ccc tac tct gtt ate aga ggt gaa gaa ttt get ttg gaa ata 2304 
Asn Leu Pro Tyr Ser Val lie Arg Gly Glu Glu Phe Ala Leu Glu He 
755 760 765 

act ata ttc aat tat ttg aaa gat gcc act gag gtt aag gta ate att 2352 
Thr He Phe Asn Tyr Leu Lys Asp Ala Thr Glu Val Lys Val He He 
770 775 780 

gag aaa agt gac aaa ttt gat att eta atg act tea aat gaa ata aat 2400 
Glu Lys Ser Asp Lys Phe Asp He Leu Met Thr Ser Asn Glu He Asn 
785 790 795 800 

gcc aca ggc cac cag cag acc ctt ctg gtt ccc agt gag gat ggg gca 2448 
Ala Thr Gly His Gin Gin Thr Leu Leu Val Pro Ser Glu Asp Gly Ala 
805 810 815 

act gtt ctt ttt ccc ate agg cca aca cat ctg gga gaa att cct ate 2496 
Thr Val Leu Phe Pro He Arg Pro Thr His Leu Gly Glu He Pro He 
820 825 830 

aca gtc aca get ctt tea ccc act get tct gat get gtc acc cag atg 2544 
Thr Val Thr Ala Leu Ser Pro Thr Ala Ser Asp Ala Val Thr Gin Met 
835 840 845 
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att tta gta aag get gaa gga at a gaa aaa tea tat tea caa tec ate 2592 
lie Leu Val Lys Ala Glu Gly lie Glu Lys Ser Tyr Ser Gin Ser lie 
850 855 860 

tta tta gac ttg act gac aat agg eta cag agt ace ctg aaa act ttg 2640 
Leu Leu Asp Leu Thr Asp Asn Arg Leu Gin Ser Thr Leu Lys Thr Leu 
865 870 875 880 

agt ttc tea ttt cct cct aat aca gtg act ggc agt gaa aga gtt cag 2688 
Ser Phe Ser Phe Pro Pro Asn Thr Val Thr Gly Ser Glu Arg Val Gin 
885 890 895 

ate act gca att gga gat gtt ctt ggt cct tec ate aat ggc tta gee 2736 
lie Thr Ala lie Gly Asp Val Leu Gly Pro Ser lie Asn Gly Leu Ala 
900 905 910 

tea ttg att egg atg cct tat ggc tgt ggt gaa cag aac atg ata aat 2784 
Ser Leu He Arg Met Pro Tyr Gly Cys Gly Glu Gin Asn Met He Asn 
915 920 925 

ttt get cca aat att tac att ttg gat tat ctg act aaa aag aaa caa 2832 
Phe Ala Pro Asn He Tyr He Leu Asp Tyr Leu Thr Lys Lys Lys Gin 
930 935 940 

ctg aqa gat aat ttg aaa gaa aaa get ctt tea ttt atg agg caa ggt 2880 
Leu Thr Asp Asn Leu Lys Glu Lys Ala Leu Ser Phe Met Arg Gin Gly 
945 950 955 960 

tac cag aga gaa ctt etc tat cag agg gaa gat ggc tct ttc agt get 2928 
Tyr Gin Arg Glu Leu Leu Tyr Gin Arg Glu Asp Gly Ser Phe Ser Ala 
965 970 975 

ttt ggg aat tat gac cct tct ggg age act tgg ttg tea get ttt gtt 2976 
Phe Gly Asn Tyr Asp Pro Ser Gly Ser Thr Trp Leu Ser Ala Phe Val 
980 985 990 

tta aga tgt ttc ctt gaa gec gat cct tac ata gat att gat cag aat 3024 
Leu Arg Cys Phe Leu Glu Ala Asp Pro Tyr He Asp He Asp Gin Asn 
995 1000 "* 1005 

gtg tta cac aga aca tac act tgg ctt aaa gga cat cag aaa tec 3069 
Val Leu His Arg Thr Tyr Thr Trp Leu Lys Gly His Gin Lys Ser 
1010 1015 1020 

aac ggt gaa ttt tgg gat cca gga aga gtg att cat agt gag ctt 3114 
Asn Gly Glu Phe Trp Asp Pro Gly Arg Val lie His Ser Glu Leu 
1025 1030 1035 

caa ggt ggc aat aaa agt cca gta aca ctt aca gee tat att gta 3159 
Gin Gly Gly Asn Lys Ser Pro Val Thr Leu Thr Ala Tyr He Val 
1040 1045 1050 

act tct etc ctg gga tat aga aag tat cag cct aac att gat gtg 3204 
Thr Ser Leu Leu Gly Tyr Arg Lys Tyr Gin Pro Asn He Asp Val 
1055 1060 1065 

caa gag tct ate cat ttt ttg gag tct gaa ttc agt aga gga att 3249 
Gin Glu Ser He His Phe Leu Glu Ser Glu Phe Ser Arg Gly He 
1070 1075 1080 

tea gac aat tat act eta gee ctt ata act tat gca ttg tea tea 3294 
Ser Asp Asn Tyr Thr Leu Ala Leu He Thr Tyr Ala Leu Ser Ser 
1085 1090 1095 
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gtg ggg agt cct aaa gcg aag gaa get ttg aat atg ctg act tgg 
Val Gly Ser Pro Lys Ala Lye Glu Ala Leu Asn Met Leu Thr Trp 
1100 1105 1110 



3339 



aga gca gaa caa gaa ggt ggc atg caa ttc tgg gtg tea tea gag 
Arg Ala Glu Gin Glu Gly Gly Met Gin Phe Trp Val Ser Ser Glu 
1115 1120 1125 



3384 



tec aaa ctt tct gac tec tgg 
Ser Lys Leu Ser Asp Ser Trp 
1130 1135 



cag cca cgc tec ctg 
Gin Pro Arg Ser Leu 
1140 



gat att gaa 
Asp lie Glu 



3429 



gtt gca gec tat gca ctg etc tea cac ttc tta caa ttt cag act 
Val Ala Ala Tyr Ala Leu Leu Ser His Phe Leu Gin Phe Gin Thr 
1145 1150 1155 



3474 



tct gag gga ate cca att atg 
Ser Glu Gly He Pro lie Met 
1160 1165 



agg tgg eta age agg 
Arg Trp Leu Ser Arg 
1170 



caa aga aat 
Gin Arg Asn 



3519 



age ttg ggt ggt ttt gca tct act cag gat ace act gtg get tta 
Ser Leu Gly Gly Phe Ala Ser Thr Gin Asp Thr Thr Val Ala Leu 
1175 1180 1185 



3564 



aag get ctg tct gaa ttt gca 
Lys Ala Leu Ser Glu Phe Ala 
1190 1195 



gee eta atg aat aca gaa agg aca 
Ala Leu Met Asn Thr Glu Arg Thr 
1200 



3609 



aat ate caa gtg ace gtg acg ggg cct age tea cca agt cct gta 
Asn He Gin Val Thr Val Thr Gly Pro Ser Ser Pro Ser Pro Val 
1205 1210 1215 



3 654 



aag ttt ctg. att gac aca cac aac cgc tta etc ctt cag aca gca 
Lys Phe Leu lie Asp Thr His Asn Arg Leu Leu Leu Gin Thr Ala 
1220 1225 1230 



3699 



gag ctt get gtg gta cag cca 
Glu Leu Ala Val Val Gin Pro 
1235 1240 



atg gca gtt aat att 
Met Ala Val Asn He 
1245 



tec gca aat 
Ser Ala Asn 



3744 



ggt ttt gga ttt get att tgt cag etc aat gtt gta tat aat gtg 
Gly Phe Gly Phe Ala He Cys Gin Leu Asn Val Val Tyr Asn Val 
1250 1255 1260 



3789 



aag get tct ggg tct tct aga aga cga aga tct ate caa aat caa 
Lys Ala Ser Gly Ser Ser Arg Arg Arg Arg Ser He Gin Asn Gin 
1265 1270 1275 



3834 



gaa gee ttt gat tta gat gtt get gta aaa gaa aat aaa gat gat 
Glu Ala Phe Asp Leu Asp Val Ala Val Lys Glu Asn Lys Asp Asp 
1280 " 1285 1290 



3879 



etc aat cat gtg gat ttg aat gtg tgt aca age ttt teg ggc ccg 
Leu Asn His Val Asp Leu Asn Val Cys Thr Ser Phe Ser Gly Pro 
1295 1300 1305 



3924 



ggt agg agt ggc atg get ctt 
Gly Arg Ser Gly Met Ala Leu 
1310 1315 



atg gaa gtt aac eta 
Met Glu Val Asn Leu 
1320 



tta agt ggc 
Leu Ser Gly 



3969 



ttt atg gtg cct tea gaa gca att tct ctg age gag aca gtg aag 
Phe Met Val Pro Ser Glu Ala He Ser Leu Ser Glu Thr Val Lys 
1325 1330 1335 



4014 
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aaa gtg gaa tat gat cat gga aaa etc aac etc tat tta gat tct 4059 
Lys Val Glu Tyr Asp His Gly Lys Leu Asn Leu Tyr Leu Asp Ser 
1340 1345 1350 

gta aat gaa acc cag ttt tgt gtt aat att cct get gtg aga aac 4104 
Val Asn Glu Thr <3ln Phe Cys Val Asn He Pro Ala Val Arg Asn 
1355 1360 1365 

ttt aaa gtt tea aat acc caa gat get tea gtg tec at a gtg gat 4149 
Phe Lys Val Ser Asn Thr Gin Asp Ala Ser Val Ser He Val Asp 
1370 1375 1380 

tac tat gag cca agg aga cag gcg gtg aga agt tac aac tct gaa 4194 
Tyr Tyr Glu Pro Arg Arg Gin Ala Val Arg Ser Tyr Asn Ser Glu 
1385 1390 1395 

gtg aag ctg tec tec tgt gac ctt tgc agt gat gtc cag ggc tgc 4239 
Val Lys Leu Ser Ser Cys Asp Leu Cys Ser Asp Val Gin Gly Cys 
1400 1405 1410 

cgt cct tgt gag gat gga get tea ggc tec cat cat cac tct tea 4284 
Arg Pro Cys Glu Asp Gly Ala Ser Gly Ser His His His Ser Ser 
1415 1420 1425 

gtc att ttt att ttc tgt ttc aag ctt ctg tac ttt atg gaa ctt 4329 
Val He Phe He Phe Cys Phe Lys Leu Leu Tyr Phe Met Glu Leu 
1430 " 1435 1440 

tgg ctg 4335 
Trp Leu 
1445 



<210> 2 

<211> 1445 

<212> PRT 

<213> HOMAN 

<400> 2 

Met Gin Gly Pro Pro Leu Leu Thr Ala Ala His Leu Leu Cys Val Cys 
1 5 10 15 



Thr Ala Ala Leu Ala Val Ala Pro Gly Pro Arg Phe Leu Val Thr Ala 
20 25 30 



Pro Gly He He Arg Pro Gly Gly Asn Val Thr He Gly Val Glu Leu 
35 40 45 



Leu Glu His Cys Pro Ser Gin Val Thr Val Lys Ala Glu Leu Leu Lys 
50 55 60 



Thr Ala Ser Asn Leu Thr Val Ser Val Leu Glu Ala Glu Gly Val Phe 
65 70 75 80 
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Glu Lys Gly Ser Phe Lys Thr Leu Thr Leu Pro Ser Leu Pro Leu Asn 
85 90 95 



Ser Ala Asp Glu lie Tyr Glu Leu Arg Val Thr Gly Arg Thr Gin Asp 
100 105 110 



Glu He Leu Phe Ser Asn Ser Thr Arg Leu Ser Phe Glu Thr Lys Arg 
115 120 125 



He Ser Val Phe He Gin Thr Asp Lys Ala Leu Tyr Lys Pro Lys Gin 
130 135 140 



Glu Val Lys Phe Arg He Val Thr Leu Phe Ser Asp Phe Lys Pro Tyr 
145 150 155 160 



Lys Thr Ser Leu Asn He Leu He Lys Asp Pro Lys Ser Asn Leu He 
165 170 " 175 



Gin Gin Trp Leu Ser Gin Gin Ser Asp Leu Gly Val He Ser Lys Thr 
180 185 190 



Phe Gin Leu Ser Ser His Pro He Leu Gly Asp Trp Ser He Gin Val 
195 200 205 



Gin Val Asn Asp Gin Thr Tyr Tyr Gin Ser Phe Gin Val Ser Glu Tyr 
210 215 220 



Val Leu Pro Lys Phe Glu Val Thr Leu Gin Thr Pro Leu Tyr Cys Ser 
225 ~ 230 235' 240 



Met Asn Ser Lys His Leu Asn Gly Thr He Thr Ala Lys Tyr Thr Tyr 
245 250 255 



Gly Lys Pro Val Lys Gly Asp Val Thr Leu Thr Phe Leu Pro Leu Ser 
260 265 270 



Phe Trp Gly Lys Lys Lys Asn He Thr Lys Thr Phe Lys He Asn Gly 
275 280 285 



Ser Ala Asn Phe Ser Phe Asn Asp Glu Glu Met Lys Asn Val Met Asp 
290 295 300 



Ser Ser Asn Gly Leu Ser Glu Tyr Leu Asp Leu Ser Ser Pro Gly Pro 
305 310 315 ' 320 



Val Glu He Leu Thr Thr Val Thr Glu Ser Val Thr Gly He Ser Arg 
325 330 335 
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Asn Val Ser Thr Asn Val Phe Phe Lys Gin His Asp Tyr He lie Glu 
340 345 350 



Phe Phe Asp Tyr Thr Thr Val Leu Lys Pro Ser Leu Asn Phe Thr Ala 
355 360 365 



Thr Val Lys Val Thr Arg Ala Asp Gly Asn Gin Leu Thr Leu Glu* Glu 
370 375 380 



Arg Arg Asn Asn Val Val He Thr Val Thr Gin Arg Asn Tyr Thr Glu 
385 390 395 400 



Tyr Trp Ser Gly Ser Asn Ser Gly Asn Gin Lys Met Glu Ala Val Gin 
405 410 415 



Lys He Asn Tyr Thr Val Pro Gin Ser Gly Thr Phe Lys He Glu Phe 
420 425 430 



Pro He Leu Glu Asp Ser Ser Glu Leu Gin Leu Lys Ala Tyr Phe Leu 
435 440 445 



Gly Ser Lys Ser Ser Met Ala Val His Ser Leu Phe Lys Ser Pro Ser 
450 455 460 



Lys Thr Tyr He Gin Leu Lys Thr Arg Asp Glu Asn He Lys Val Gly 
465 470 475 480 



Ser Pro Phe Glu Leu Val Val Ser Gly Asn Lys Arg Leu Lys Glu Leu 
485 490 495 



Ser Tyr Met Val Val Ser Arg Gly Gin Leu Val Ala Val Gly Lys Gin 
500 505 510 



Asn Ser Thr* Met Phe Ser Leu Thr Pro Glu Asn Ser Trp Thr Pro Lys 
515 * 520 525 



Ala Cys val He Val Tyr Tyr lie Glu Asp Asp Gly Glu He He Ser 
.530 535 540 



Asp Val Leu Lys He Pro Val Gin Leu Val Phe Lys Asn Lys He Lys 
545 550 555 560 



Leu Tyr Trp Ser Lys Val Lys Ala Glu Pro Ser Glu Lys Val Ser Leu 
565 570 575 



Arg He Ser Val Thr Gin Pro Asp Ser He Val Gly He Val Ala Val 
580 585 590 
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Asp Lys Ser Val Asn Leu Met Asn Ala Ser Asn Asp lie Thr Met Glu 
595 600 605 



Asn Val Val His Glu Leu Glu Leu Tyr Asn Thr Gly Tyr Tyr Leu Gly 
610 615 620 



Met Phe Met Asn Ser Phe Ala Val Phe Gin Glu Cys Gly Leu Trp Val 
625 630 635 640 



Leu Thr Asp Ala Asn Leu Thr Lys Asp Tyr He Asp Gly Val Tyr Asp 
645 650 655 



Asn Ala Glu Tyr Ala Glu Arg Phe Met Glu Glu Asn Glu Gly His He 
660 665 670 



Val Asp He His Asp Phe Ser Leu Gly Ser Ser Pro His Val Arg Lys 
675 680 685 



His Phe Pro Glu Thr Trp He Trp Leu Asp Thr Asn Met Gly Tyr Arg 
690 695 700 



He Tyr Gin Glu Phe Glu Val Thr Val Pro Asp Ser He Thr Ser Trp 
705 710 715 720 



Val Ala Thr Gly Phe Val He Ser Glu Asp Leu Gly Leu Gly Leu Thr 
725 730 735 



Thr Thr Pro Val Glu Leu Gin Ala Phe Gin Pro Phe Phe He Phe Leu 
740 745 750 



Asn Leu Pro Tyr Ser Val He Arg Gly Glu Glu Phe Ala Leu Glu He 
755 760 765 



Thr He Phe Asn Tyr Leu Lys Asp Ala Thr Glu Val Lys Val life He 
770 775 780 



Glu Lys Ser Asp Lys Phe Asp He Leu Met Thr Ser Asn Glu He Asn 
785 790 795 800 



Ala Thr Gly His Gin Gin Thr Leu Leu Val Pro Ser Glu Asp Gly Ala 
805 810 815 



Thr Val Leu Phe Pro He Arg Pro Thr His Leu Gly Glu He Pro He 
820 825 830 



Thr Val Thr Ala Leu Ser Pro Thr Ala Ser Asp Ala Val Thr Gin Met 
835 840 ~ 845 
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lie Leu Val Lys Ala Glu Gly He Glu Lys Ser Tyr Ser Gin Ser He 
850 855 860 



Leu Leu Asp Leu Thr Asp Asn Arg Leu Gin Ser Thr Leu Lys "Thr Leu 
865 870 875 880 



Ser Phe Ser Phe Pro Pro Asn Thr Val Thr Gly Ser Glu Arg Val Gin 
885 890 895 



He Thr Ala He Gly Asp Val Leu Gly Pro Ser He Asn Gly Leu Ala 
900 905 910 



Ser Leu He Arg Met Pro Tyr Gly Cys Gly Glu Gin Asn Met He Asn 
915 920 925 



Phe Ala Pro Asn He Tyr He Leu Asp Tyr Leu Thr Lys Lys Lys Gin 
930 935 940 



Leu Thr Asp Asn Leu Lys Glu Lys Ala Leu Ser Phe Met Arg Gin Gly 
945 950 955 960 



Tyr Gin Arg Glu Leu Leu Tyr Gin Arg Glu Asp Gly Ser Phe Ser Ala 
965 970 975 



Phe Gly Asn Tyr Asp Pro Ser Gly Ser Thr Trp Leu Ser Ala Phe Val 
980 985 990 



Leu Arg Cys Phe Leu Glu Ala Asp Pro Tyr He Asp He Asp Gin Asn 
995 1000 1005 



Val Leu His Arg Thr Tyr Thr Trp Leu Lys Gly His Gin Lys Ser 
1010 1015 1020 



Asn Gly Glu Phe Trp Asp Pro Gly Arg Val He His Ser Glu Leu 
1025 1030 1035 



Gin Gly Gly Asn Lys Ser Pro Val Thr Leu Thr Ala Tyr He Val 
1040 1045 1050 



Thr Ser Leu Leu Gly Tyr Arg Lys Tyr Gin Pro Asn He Asp Val 
1055 1060 1065 



Gin Glu Ser He His Phe Leu Glu Ser Glu Phe Ser Arg Gly He 
1070 1075 1080 



Ser Asp Asn Tyr Thr Leu Ala Leu He Thr Tyr Ala Leu Ser Ser 
1085 1090 1095 
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Val Gly Ser Pro Lys Ala Lys Glu Ala Leu Asn Met Leu Thr Trp 
1100 1105 1110 



Arg Ala Glu Gin Glu Gly Gly Met Gin Phe Trp Val Ser Ser Glu 
1115 1120 1125 



Ser Lys Leu Ser Asp Ser Trp Gin Pro Arg Ser Leu Asp lie Glu 
1130 1135 1140 



Val Ala Ala Tyr Ala Leu Leu Ser His Phe Leu Gin Phe Gin Thr 
1145 1150 1155 



Ser Glu Gly lie Pro lie Met Arg Trp Leu Ser Arg Gin Arg Asn 
1160 1165 1170 



Ser Leu Gly Gly Phe Ala Ser Thr Gin Asp Thr Thr Val Ala Leu 
1175 1180 1185 



Lys Ala Leu Ser Glu Phe Ala Ala Leu Met Asn Thr Glu Arg Thr 
1190 1195 1200 



Asn lie Gin Val Thr Val Thr Gly Pro Ser Ser Pro Ser Pro Val 
1205 1210 . 1215 



Lys Phe Leu lie Asp Thr His Asn Arg Leu Leu Leu Gin Thr Ala 
1220 1225 1230 



Glu Leu Ala Val Val Gin Pro Met Ala Val Asn He Ser Ala Asn 
1235 1240 1245 



Gly Phe Gly Phe Ala He Cys Gin Leu Asn Val Val Tyr Asn Val 
1250 1255 1260 



Lys Ala Ser Gly Ser Ser Arg Arg Arg Arg Ser He Gin Asn Gin 
1265 1270 1275 



Glu Ala Phe Asp Leu Asp Val Ala Val Lys Glu Asn Lys Asp Asp 
1280 1285 1290 



Leu Asn His Val Asp Leu Asn Val Cys Thr Ser Phe Ser Gly Pro 
1295 1300 1305 



Gly Arg Ser Gly Met Ala Leu Met Glu Val Asn Leu Leu Ser Gly 
1310 1315 1320 



Phe Met Val Pro Ser Glu Ala He Ser Leu Ser Glu Thr Val Lys 
1325 1330 1335 
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Lys Val Glu Tyr Asp His Gly Lys Leu Asn Leu Tyr Leu Asp Ser 
1340 ' 1345 1350 



Val Asn Glu Thr Gin Phe Cys Val Asn lie Pro Ala Val Arg Asn 
1355 1360 1365 



Phe Lys Val Ser Asn Thr Gin Asp Ala Ser Val Ser lie Val Asp 
1370 1375 1380 



Tyr Tyr Glu Pro Arg Arg Gin Ala Val Arg Ser Tyr Asn Ser Glu 
1385 1390 1395 



Val Lys Leu Ser Ser Cys Asp Leu Cys Ser Asp Val Gin Gly Cys 
1400 1405 1410 . 



Arg Pro Cys Glu Asp Gly Ala Ser Gly Ser His His His Ser Ser 
1415 1420 1425 



Val lie Phe lie Phe Cys Phe Lys Leu Leu Tyr Phe Met Glu Leu 
1430 ^ 1435 1440 



Trp Leu 
1445 



<210> 3 

<211> 4335 

<212> DNA 

<213> HUMAN 



<220> 

<221> CDS 

<222> (1) . . (4335) 

<220> 

<221> CDS 

<222> (1) . . (4335) 

<400> 3 

atg cag ggc cca ccg etc ctg acc gec gec cac etc etc tgc gtg tgc 48 
Met Gin Gly Pro Pro Leu Leu Thr Ala Ala His Leu Leu Cys Val Cys 
15 10 15 
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acc gcc gcg ctg gcc gtg get ccc ggg cct egg ttt ctg gtg aca gec 96 
Thr Ala Ala Leu Ala Val Ala Pro Gly Pro Arg Phe Leu Val Thr Ala 
20 25 30 

cca ggg ate ate agg ccc gga gga aat gtg act att ggg gtg gag ctt 144 
Pro Gly lie He Arg Pro Gly Gly Asn Val Thr He Gly Val Glu Leu 
35 40 45 

ctg gaa cac tgc cct tea cag gtg act gtg aag gcg gag ctg etc aag 192 
Leu Glu His Cys Pro Ser Gin Val Thr Val Lys Ala Glu Leu Leu Lys 
50 55 60 

aca gca tea aac etc act gtc tct gtc ctg gaa gca gaa gga gtc ttt 240 
Thr Ala Ser Asn Leu Thr Val Ser Val Leu Glu Ala Glu Gly Val Phe 
65 70 75 80 

gaa aaa ggc tct ttt aag aca ctt act ctt cca tea eta cct ctg aac 288 
Glu Lys Gly Ser Phe Lys Thr Leu Thr Leu Pro Ser Leu Pro Leu Asn 
85 90 95 

agt gca gat gag att tat gag eta cgt gta acc gga cgt acc cag gat 336 
Ser Ala Asp Glu He Tyr Glu Leu Arg Val Thr Gly Arg Thr Gin Asp 
100 105 110 

gag att tta ttc tct aat agt acc cgc tta tea ttt gag acc aag aga 384 
Glu He Leu Phe Ser Asn Ser Thr Arg Leu Ser Phe Glu Thr Lys Arg 
115 120 125 

ata tct gtc ttc att caa aca gac aag gcc tta tac aag cca aag caa 432 
He Ser Val Phe He Gin Thr Asp Lys Ala Leu Tyr Lys Pro Lys Gin 
130 135 140 

gaa gtg aag ttt cgc att gtt aca etc ttc tea gat ttt aag cct tac 480 
Glu Val Lys Phe Arg lie Val Thr Leu Phe Ser Asp Phe Lys Pro Tyr 
145 150 155 160 

aaa acc tct tta aac att etc att aag gac ccc aaa tea aat ttg ate 528 
Lys Thr Ser Leu Asn He Leu He Lys Asp Pro Lys Ser Asn Leu lie 
165 170 175 

caa cag tgg ttg tea caa caa agt gat ctt gga gtc att tec aaa act 576 
Gin Gin Trp Leu Ser Gin Gin Ser Asp Leu Gly Val He Ser Lys Thr 
180 185 190 

ttt cag eta tct tec cat cca ata ctt ggt gac tgg tct att caa gtt 624 
Phe Gin Leu Ser Ser His Pro He Leu Gly Asp Trp Ser He Gin Val 
195 200 205 

caa gtg aat gac cag aca tat tat caa tea ttt cag gtt tea gaa tat 672 
Gin Val Asn Asp Gin Thr Tyr Tyr Gin Ser Phe Gin Val Ser Glu Tyr 
210 215 220 

gta tta cca aaa ttt gaa gtg act ttg cag aca cca tta tat tgt tct 720 
Val Leu Pro Lys Phe Glu Val Thr Leu Gin Thr Pro Leu Tyr Cys Ser 
225 230 235 240 

atg aat tct aag cat tta aat ggt acc ate acg gca aag tat aca tat 768 
Met Asn Ser Lys His Leu Asn Gly Thr He Thr Ala Lys Tyr Thr Tyr 
245 250 255 

ggg aag cca gtg aaa gga gac gta acg ctt aca ttt tta cct tta tec 816 
Gly Lys Pro Val Lys Gly Asp Val Thr Leu Thr Phe Leu Pro Leu Ser 
260 265 270 
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ttt tgg gga aag aag aaa aat att aca aaa aca ttt aag ata aat gga 
Phe Trp Gly Lys Lys Lys Asn He Thr Lys Thr Phe Lys He Asn Gly 
275 * 280 285 



864 



tct gca aac ttc tct ttt aat gat gaa gag atg aaa aat gta atg gat 912 
Ser Ala Asn Phe Ser Phe Asn Asp Glu Glu Met Lys Asn Val Met Asp 
290 295 300 

tct tea aat gga ctt tct gaa tac ctg gat eta tct tec cct gga cca 960 
Ser Ser Asn Gly Leu Ser Glu Tyx Leu Asp Leu Ser Ser Pro Gly Pro 
305 310 315 320 

gta gaa att tta ace aca gtg aca gaa tea gtt aca ggt att tea aga 1008 
Val Glu He Leu Thr Thr Val Thr Glu Ser Val Thr Gly He Ser Arg 
325 330 335 

aat gta age act aat gtg ttc ttc aag caa cat gat tac ate att gag 1056 
Asn Val Ser Thr Asn Val Phe Phe Lys Gin His Asp Tyr He He Glu 
340 345 350 

ttt ttt gat tat act act gtc ttg aag cca tct etc aac ttc aca gee 1104 
Phe Phe Asp Tyr Thr Thr Val Leu Lys Pro Ser Leu Asn Phe Thr Ala 
355 360 365 

act gtg aag gta act cgt get gat ggc aac caa ctg act ctt gaa gaa 1152 
Thr Val Lys Val Thr Arg Ala Asp Gly Asn Gin Leu Thr Leu Glu Glu 
370 375 380 

aga aga aat aat gta gtc ata aca gtg aca cag aga aac tat act gag 1200 
Arg Arg Asn Asn Val Val He Thr Val Thr Gin Arg Asn Tyr Thr Glu 
385 390 395 400 

tac tgg age gga tct aac agt gga aat cag aaa atg gaa get gtt cag 1248 
Tyr Trp Ser Gly Ser Asn Ser Gly Asn Gin Lys Met Glu Ala Val Gin 
405 410 415 

aaa ata aat tat act gtc ccc caa agt gga act ttt aag att gaa ttc 1296 
Lys He Asn Tyr Thr Val Pro Gin Ser Gly Thr Phe Lys He Glu Phe 
420 425 430 

cca ate ctg gag gat tec agt gag eta cag ttg aag gee tat ttc ctt 1344 
Pro He Leu Glu Asp Ser Ser Glu Leu Gin Leu Lys Ala Tyr Phe Leu 
435 440 445 

ggt agt aaa agt age atg gca gtt cat agt ctg ttt aag tct cct agt 1392 
Gly Ser Lys Ser Ser Met Ala Val His Ser Leu Phe Lys Ser Pro Ser 
450 455 460 

aag aca tac ate caa eta aaa aca aga gat gaa aat ata aag gtg gga 1440 
Lys Thr Tyr He Gin Leu Lys Thr Arg Asp Glu Asn He Lys Val Gly 
465 470 475 480 

teg cct ttt gag ttg gtg gtt agt ggc aac aaa cga ttg aag gag tta 1488 
Ser Pro Phe Glu Leu Val Val Ser Gly Asn Lys Arg Leu Lys Glu Leu 
485 490 495 

age tat atg gta gta tec agg gga cag ttg gtg get gta gga aaa caa 1536 
Ser Tyr Met Val Val Ser Arg Gly Gin Leu Val Ala Val Gly Lys Gin 
500 505 510 

aat tea aca atg ttc tct tta aca cca gaa aat tct tgg act cca aaa 1584 
Asn Ser Thr Met Phe Ser Leu Thr Pro Glu Asn Ser Trp Thr Pro Lys 
515 520 525 
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gcc tgt gta att gtg tat tat att gaa gat gat ggg gaa att ata agt 1632 
Ala Cys Val lie Val Tyr Tyr lie Glu Asp Asp Gly Glu lie lie Ser 
530 535 540 

gat gtt eta aaa att cct gtt cag ctt gtt ttt aaa aat aag ata aag 1680 
Asp Val Leu Lys lie Pro Val Gin Leu Val Phe Lys Asu Lys lie Lys 
545 550 555 560 

eta tat tgg agt aaa gtg aaa get gaa cca tct gag aaa gtc tct ctt 1728 
Leu Tyr Trp Ser Lys Val Lys Ala Glu Pro Ser Glu Lys Val Ser Leu 
565 570 575 

agg ate tct gtg aca cag cct gac tec ata gtt ggg att gta get gtt 1776 
Arg He Ser Val Thr Gin Pro Asp Ser He Val Gly He Val Ala Val 
580 585 590 

gac aaa agt gtg aat ctg atg aat gcc tct aat gat att aca atg gaa 1824 
Asp Lys Ser Val Asn Leu Met Asn Ala Ser Asn Asp He Thr Met Glu 
595 600 605 

aat gtg gtc cat gag ttg gaa ctt tat aac aca gga tat tat tta ggc 1872 
Asn Val Val His Glu Leu Glu Leu Tyr Asn Thr Gly Tyr Tyr Leu Gly 
610 615 620 

atg ttc atg aat tct ttt gca gtc ttt cag gaa tgt gga etc tgg gta 1920 
Met Phe Met Asn Ser Phe Ala Val Phe Gin Glu Cys Gly Leu Trp Val 
625 630 635 640 

ttg aca gat gca aac etc acg aag gat tat att gat ggt gtt tat gac 1968 
Leu Thr Asp Ala Asn Leu Thr Lys Asp Tyr He Asp Gly Val Tyr Asp 
645 650 655 

aat gca gaa tat get gag agg ttt atg gag gaa aat gaa gga cat att 2016 
Asn Ala Glu Tyr Ala Glu Arg Phe Met Glu Glu Asn Glu Gly His lie. 
660 665 670 

gta gat att cat gac ttt tct ttg ggt age agt cca cat gtc cga aag 2064 
Val Asp He His Asp Phe Ser Leu Gly Ser Ser Pro His Val Arg Lys . 
675 680 685 

cat ttt cca gag act tgg att tgg eta gac ace aac atg ggt tec agg 2112 
His Phe Pro Glu Thr Trp He Trp Leu Asp Thr Asn Met Gly Ser Arg 
690 695 700 

att tac caa gaa ttt gaa gta act gta cct gat tct ate act tct tgg 2160 
He Tyr Gin Glu Phe Glu Val Thr Val Pro Asp Ser He Thr Ser Trp 
705 710 715 720 

gtg get act ggt ttt gtg ate tct gag gac ctg ggt ctt gga eta aca * 2208 
Val Alci Thr Gly Phe Val He Ser Glu Asp Leu Gly Leu Gly Leu Thr 
725 730 735 

act act cca gtg gag etc caa gcc ttc caa cca ttt ttc att ttt ttg 2256 
Thr Thr Pro Val Glu Leu Gin Ala Phe Gin Pro Phe Phe He Phe Leu 
740 745 750 

aat ctt ccc tac tct gtt ate aga ggt gaa gaa ttt get ttg gaa ata 2304 
Asn Leu Pro Tyr Ser Val He Arg Gly Glu Glu Phe Ala Leu Glu He 
755 760 765 

act ata ttc aat tat ttg aaa gat gcc act gag gtt aag gta ate att 2352 
Thr He Phe Asn Tyr Leu Lys Asp Ala Thr Glu Val Lys Val He He 
770 775 780 
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gag aaa agt gac aaa ttt gat att eta atg act tea aat gaa ata aat 2400 
Glu Iiys Ser Asp Lys Phe Asp lie Leu Met Thr Ser Asn Glu lie Asn 
785 790 795 800 

gec aca ggc cac cag cag acc ctt ctg gtt ccc agt gag gat ggg gca 2448 
Ala Thr Gly His Gin Gin Thr Leu Leu Val Pro Ser Glu Asp Gly Ala 
805 810 815 

act gtt ctt ttt ccc ate agg cca aca cat ctg gga gaa att cct ate 2496 
Thr Val Leu Phe Pro He Arg Pro Thr His Leu Gly Glu He Pro He 
820 825 830 

aca gtc aca get ctt tea ccc act get tct gat get gtc acc cag atg 2544 
Thr Val Thr Ala Leu Ser Pro Thr Ala Ser Asp Ala Val Thr Gin Met 
835 840 845 

att tta gta aag get gaa gga ata gaa aaa tea tat tea caa tec ate 2592 
He Leu Val Lys Ala Glu Gly He Glu Lys Ser Tyr Ser Gin Ser He 
850 855 860 

tta tta gac ttg act gac aat agg eta cag agt acc ctg aaa act ttg 2640 
Leu Leu Asp Leu Thr Asp Asn Arg Leu Gin Ser Thr Leu Lys Thr Leu 
865 870 875 880 

agt ttc tea ttt cct cct aat aca gtg act ggc agt gaa aga gtt cag 2688 
Ser Phe Ser Phe Pro Pro Asn Thr Val Thr Gly Ser Glu Arg Val Gin 
885 890 895 

ate act gca att gga gat gtt ctt ggt cct tec ate aat ggc tta gee 2736 
He Thr Ala He Gly Asp Val Leu Gly Pro Ser He Asn Gly Leu Ala 
900 905 910 

» 

tea ttg att egg atg cct tat ggc tgt ggt gaa cag aac atg ata aat 2784 
Ser Leu He Arg Met Pro Tyr Gly Cys Gly Glu Gin Asn Met He Asn 
915 920 925 

ttt get cca aat att tac att ttg gat tat ctg act aaa aag aaa caa 2832 
Phe Ala Pro Asn He Tyr He Leu Asp Tyr Leu Thr Lys Lys Lys Gin 
930 935 940 

ctg aca gat aat ttg aaa gaa aaa get ctt tea ttt atg agg caa ggt 2880 
Leu Thr Asp Asn Leu Lys Glu Lys Ala Leu Ser Phe Met Arg Gin Gly 
945 950 955 960 

tac cag aga gaa ctt etc tat cag agg gaa gat ggc tct ttc agt get 2928 
Tyr Gin Arg Glu Leu Leu Tyr Gin Arg Glu Asp Gly Ser Phe Ser Ala 
965 970 975 

ttt ggg aat tat gac cct tct ggg age act tgg ttg tea get ttt gtt 2976 
Phe Gly Asn Tyr Asp Pro Ser Gly Ser Thr Trp Leu Ser Ala Phe Val 
980 985 990 

tta aga tgt ttc ctt gaa gee gat cct tac ata gat att gat cag aat 3024 
Leu Arg Cys Phe Leu Glu Ala Asp Pro Tyr He Asp He Asp Gin Asn 
995 1000 1005 

gtg tta cac aga aca tac act tgg ctt aaa gga cat cag aaa tec 3069 
Val Leu His Arg Thr Tyr Thr Trp Leu Lys Gly His Gin Lys Ser 
1010 1015 1020 

aac ggt gaa ttt tgg gat cca gga aga gtg att cat agt gag ctt 3114 
Asn Gly Glu Phe Trp Asp Pro Gly Arg Val He His Ser Glu Leu 
1025 1030 1035 
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caa ggt ggc aat aaa agt cca gta aca ctt aca gcc tat att gta 
Gin Gly Gly Asn Lys Ser Pro Val Thr Leu Thr Ala Tyr He Val 
1040 1045 1050 



3159 



act tct etc ctg gga tat aga 
Thr Ser Leu Leu Gly Tyr Arg 
1055 1060 

caa gag tct ate cat ttt ttg 
Gin Glu Ser He His Phe Leu 
1070 1075 



aag tat cag cct aac 
Lys Tyr Gin Pro Asn 
1065 



att gat gtg 
He Asp Val 



gag tct gaa ttc agt aga gga att 
Glu Ser Glu Phe Ser Arg Gly He 
1080 



3204 



324J 



tea gac aat tat act eta gcc ctt ata act tat gca ttg tea tea 
Ser Asp Asn Tyr Thr Leu Ala Leu He Thr Tyr Ala Leu Ser Ser 
1085 1090 1095 



3294 



gtg ggg agt cct aaa gcg aag 
Val Gly Ser Pro Lys Ala Lys 
1100 1105 



gaa get ttg aat atg 
Glu Ala Leu Asn Met 
1110 



ctg act tgg 
Leu Thr Trp 



3339 



aga gca gaa caa gaa ggt ggc atg caa ttc tgg gtg tea tea gag 
Arg Ala Glu Gin Glu Gly Gly Met Gin Phe Trp Val Ser Ser Glu 
1115 1120 1125 



3384 



tec aaa ctt tct gac tec tgg 
Ser Lys Leu Ser Asp Ser Trp 
1130 1135 



cag cca cgc tec ctg 
Gin Pro Arg Ser Leu 
1140 



gat att gaa 
Asp He Glu 



3429 



gtt gca gcc tat gca ctg etc 
Val Ala Ala Tyr Ala Leu Leu 
1145 1150 



tea cac ttc tta caa 
Ser His Phe Leu Gin 
1155 



ttt cag act 
Phe Gin Thr 



3474 



tct gag gga ate cca att atg agg tgg eta age agg caa aga aat 
Ser Glu Gly He Pro He Met Arg Trp Leu Ser Arg Gin Arg Asn 
1160 1165 1170 



3519 



age ttg ggt ggt ttt gca tct act cag gat acc act gtg get tta 
Ser Leu Gly Gly Phe Ala Ser Thr Gin Asp Thr Thr Val Ala Leu 
1175 1180 1185 



3564 



aag get ctg tct gaa ttt gca 
Lys Ala Leu Ser Glu Phe Ala 
1190 1195 



gcc eta atg aat aca gaa agg aca 
Ala Leu Met Asn Thr Glu Arg Thr 
1200 



3609 



aat ate caa gtg acc gtg acg ggg cct age tea cca agt cct gta 
Asn He Gin Val Thr Val Thr Gly Pro Ser Ser Pro Ser Pro Val 
1205 1210 1215 



3654 



aag ttt ctg att gac aca cac aac cgc tta etc ctt cag aca gca 
Lys Phe Leu He Asp Thr His Asn Arg Leu Leu Leu Gin Thr Ala 
1220 1225 1230 



3699 



gag ctt get gtg gta cag cca atg gca gtt aat att tec gca aat 
Glu Leu Ala Val Val Gin Pro Met Ala Val Asn He Ser Ala Asn 
1235 1240 1245 



3744 



ggt ttt gga ttt get att tgt cag etc aat gtt gta tat aat gtg 
Gly Phe Gly Phe. Ala He Cys Gin Leu Asn Val Val Tyr Asn Val 
1250 1255 1260 



3789 



aag get tct ggg tct tct aga aga cga aga tct ate caa aat caa 
Lys Ala Ser Gly Ser Ser Arg Arg Arg Arg Ser He Gin Asn Gin 
1265 1270 1275 



3834 
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gaa gcc ttt gat tta gat gtt 
Glu Ala Phe Asp Leu Asp Val 
1280 1285 



get gta aaa gaa aat 
Ala Val Lys Glu Asn 
1290 



etc aat cat gtg gat ttg aat gtg tgt aca age ttt 
Leu Asn His Val Asp Leu Asn Val Cys Thr Ser Phe 
1295 1300 1305 



aaa gat gat 
Lys Asp Asp 



teg ggc ccg 
Ser Gly Pro 



3879 



3924 



ggt agg agt ggc atg get ctt 
Gly Arg Ser Gly Met Ala Leu 
1310 1315 



atg gaa gtt aac eta 
Met Glu Val Asn Leu 
1320 



tta agt ggc 
Leu Ser Gly 



3969 



ttt atg gtg cct tea gaa gca 
Phe Met Val' Pro Ser Glu Ala 
1325 1330 



att tct ctg age gag 
lie Ser Leu Ser Glu 
1335 



aca gtg aag 
Thr Val Lys 



4014 



aaa gtg gaa tat gat cat gga 
Lys Val Glu Tyr Asp His Gly 
1340 1345 



aaa etc aac etc tat 
Lys Leu Asn Leu Tyr 
1350 



tta gat tct 
Leu Asp Ser 



4059 



gta aat gaa acc cag ttt tgt 
Val Asn Glu Thr Gin Phe Cys 
1355 1360 



gtt aat att cct get 
Val Asn He Pro Ala 
1365 



gtg aga aac 
Val Arg Asn 



4104 



ttt aaa gtt tea aat acc caa 
Phe Lys Val Ser Asn Thr Gin 
1370 1375 



gat get tea gtg tec ata gtg gat 
Asp Ala Ser Val Ser He Val Asp 
1380 



4149 



tac tat gag cca agg aga cag gcg gtg aga agt tac 
Tyr Tyr Glu Pro Arg Arg Gin Ala Val Arg Ser Tyr 
1385 1390 1395 

gtg aag ctg tec tec tgt gac ctt tgc agt gat gtc 
Val Lys Leu Ser Ser Cys Asp Leu Cys Ser Asp Val 
1400 1405 1410 

cgt cct tgt gag gat gga get tea ggc tec cat cat 
Arg Pro Cys Glu Asp Gly Ala Ser Gly Ser His His 
1415 1420 1425 



aac tct gaa 
Asn Ser Glu 



cag ggc tgc 
Gin Gly Cys 



cac tct tea 
His Ser Ser 



4194 



4239 



4284 



gtc att ttt att ttc tgt ttc 
Val He Phe He Phe Cys Phe 
1430 1435 



aag ctt ctg tac ttt 
Lys Leu Leu Tyr Phe 
1440 



atg gaa ctt 
Met Glu Leu 



4329 



tgg ctg 
Trp Leu 
1445 



4335 



<210> 4 

<211> 1445 

<212> PRT 

<213> HUMAN 



<400> 4 

Met Gin Gly Pro Pro Leu Leu Thr Ala Ala His Leu Leu Cys Val Cys 
15 10 15 
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Thr Ala Ala Leu Ala Val Ala Pro Gly Pro Arg Phe Leu Val Thr Ala 
20 25 30 

Pro Gly lie He Arg Pro Gly Gly Asn Val Thr He Gly Val Glu Leu 
35 ~ 40 . 45 

Leu Glu His Cys Pro Ser Gin Val Thr Val Lys Ala Glu Leu Leu Lys 
50 55 60 



Thr Ala Ser Asn Leu Thr Val Ser Val Leu Glu Ala Glu Gly Val Phe 
65 70 75 80 



Glu Lys Gly Ser Phe Lys Thr Leu Thr Leu Pro Ser Leu Pro Leu Asn 
85 90 95 



Ser Ala Asp Glu He Tyr Glu Leu Arg Val Thr Gly Arg Thr Gin Asp 
100 105 110 



Glu He Leu Phe Ser Asn Ser Thr Arg Leu Ser Phe Glu Thr Lys Arg 
115 120 125 



He Ser Val Phe He Gin Thr Asp Lys Ala Leu Tyr Lys Pro Lys Gin 
130 135 140 



Glu Val Lys Phe Arg He Val Thr Leu Phe Ser Asp Phe Lys Pro Tyr 
145 150 155 1&0 



Lys Thr Ser Leu Asn lie Leu He Lys Asp Pro Lys Ser Asn Leu He 
165 170 175 



Gin Gin Trp Leu Ser Gin Gin Ser Asp Leu Gly Val He Ser Lys Thr 
180 185 190 



Phe Gin Leu Ser Ser His Pro He Leu Gly Asp Trp Ser He Gin Val 
195 200 205 



Gin Val Asn Asp Gin Thr Tyr Tjnc Gin Ser Phe Gin Val Ser Glu Tyr 
210 215 220 



Val Leu Pro Lys Phe Glu Val Thr Leu Gin Thr Pro Leu Tyr Cys Ser 
225 230 235 240 



Met Asn Ser Lys His Leu Asn Gly Thr He Thr Ala Lys Tyr Thr Tyr 
245 250 255 



Gly Lys Pro Val Lys Gly Asp Val Thr Leu Thr Phe Leu Pro Leu Ser 
260 " ^ " 265 270 
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Phe Trp Gly Lys Lys Lys Asn lie Thr Lys Thr Phe Lys lie Asn Gly 
275 280 285 



Ser Ala Asn Phe Ser Phe Asn Asp Glu Glu Met Lys Asn Val Met Asp 
290 295 300 



Ser Ser Asn Gly Leu Ser Glu Tyr Leu Asp Leu Ser Ser Pro Gly Pro 
305 310 315 320 



Val Glu lie Leu Thr Thr Val Thr Glu Ser Val Thr Gly lie Ser Arg 
325 330 335 



Asn Val Ser Thr Asn Val Phe Phe Lys Gin His Asp Tyr lie lie Glu 
340 345 350 



Phe Phe Asp Tyr Thr Thr Val Leu Lys Pro Ser Leu Asn Phe Thr Ala 
355 360 365 



Thr Val Lys Val Thr Arg Ala Asp Gly Asn Gin Leu Thr Leu Glu Glu 
370 375 380 



Arg Arg Asn Asn Val Val lie Thr Val Thr Gin Arg Asn Tyr Thr Glu 
385 390 395 400 



Tyr Trp Ser Gly Ser Asn Ser Gly Asn Gin Lys Met Glu Ala Val Gin 
405 410 415 



Lys lie Asn Tyr Thr Val Pro Gin Ser Gly Thr Phe Lys lie Glu Phe 
420 425 430 



Pro lie Leu Glu Asp Ser Ser Glu Leu Gin Leu Lys Ala Tyr Phe Leu 
435 440 445. 



Gly Ser Lys Ser Ser Met Ala Val His Ser Leu Phe Lys Ser Pro Ser 
450 455 460 



Lys Thr Tyr lie Gin Leu Lys Thr Arg Asp Glu Asn lie Lys Val Gly 
465 470 475 480 



Ser Pro Phe Glu Leu Val Val Ser Gly Asn Lys Arg Leu Lys Glu Leu 
485 490 495 



Ser Tyr Met Val Val Ser Arg Gly Gin Leu Val Ala Val Gly Lys Gin 
500 505 510 



Asn Ser Thr Met Phe Ser Leu Thr Pro Glu Asn Ser Trp Thr Pro Lys 
515 520 525 
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Ala Cys Val He Val Tyr Tyr He Glu Asp Asp Gly Glu He He Ser 
530 535 540 



Asp Val Leu Lys He Pro Val Gin Leu Val Phe Lys Asn Lys He Lys 
545 550 555 560 



Leu Tyr Trp Ser Lys Val Lys Ala Glu Pro Ser Glu Lys Val Ser Leu 
565 " 570 * 575 



Arg He Ser Val Thr Gin Pro Asp Ser He Val Gly He Val Ala Val 
580 585 590 



Asp Lys Ser Val Asn Leu Met Asn Ala Ser Asn Asp He Thr Met Glu 
595 600 605 



Asn Val Val His Glu Leu Glu Leu Tyr Asn Thr Gly Tyr Tyr Leu Gly 
610 615 620 



Met Phe Met Asn Ser Phe Ala Val Phe Gin Glu Cys Gly Leu Trp Val 
625 630 635 -640 



Leu Thr Asp Ala Asn Leu Thr Lys Asp Tyr He Asp Gly Val Tyr Asp 
645 650 655 



Asn Ala Glu Tyr Ala Glu Arg Phe Met Glu Glu Asn Glu Gly His He 
660 665 670 



Val Asp He His Asp Phe Ser Leu Gly Ser Ser Pro His Val Arg Lys 
675 680 685 



His Phe Pro Glu Thr Trp He Trp Leu Asp Thr Asn Met Gly Ser Arg 
690 695 700 



He Tyr Gin Glu Phe Glu Val Thr Val Pro Asp Ser He Thr Ser Trp 
705 710 715 720 



Val Ala Thr Gly Phe Val He Ser Glu Asp Leu Gly Leu Gly Leu Thr 
725 730 735 



Thr Thr Pro Val Glu Leu Gin Ala Phe Gin Pro Phe Phe He Phe Leu 
740 745 750 



Asn Leu Pro Tyr Ser Val He Arg Gly Glu Glu Phe Ala Leu Glu He 
755 760 765 



Thr He Phe Asn Tyr Leu Lys Asp Ala Thr Glu Val Lys Val He He 
770 775 780 
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Glu Lys Ser Asp Lys Phe Asp lie Leu Met Thr Ser Asn Glu lie Asn 
785 790 795 800 



Ala Thr Gly His Gin Gin Thr Leu Leu Val Pro Ser Glu Asp Gly Ala 
805 810 815 



Thr Val Leu Phe Pro lie Arg Pro Thr His Leu Gly Glu lie Pro lie 
820 825 830 



Thr Val Thr Ala Leu Ser Pro Thr Ala Ser Asp Ala Val Thr Gin Met 
835 840 845 



lie Leu Val Lys Ala Glu Gly lie Glu Lys Ser Tyr Ser Gin Ser lie 
850 855 860 



Leu Leu Asp Leu Thr Asp Asn Arg Leu Gin Ser Thr Leu Lys Thr Leu 
865 870 875 880 



Ser Phe Ser Phe Pro Pro Asn Thr Val Thr Gly Ser Glu Arg Val Gin 
885 890 895 



lie Thr Ala lie Gly Asp Val Leu Gly Pro Ser He Asn Gly Leu Ala 
900 905 910 



Ser Leu He Arg Met Pro Tyr Gly Cys Gly Glu Gin Asn Met He Asn 
915 920 925 



Phe Ala Pro Asn He Tyr He Leu Asp Tyr Leu Thr Lys Lys Lys Gin 
930 935 940 



Leu Thr Asp Asn Leu Lys Glu Lys Ala Leu Ser Phe Met Arg Gin Gly 
945 950 955 960 



Tyr Gin Arg Glu Leu Leu Tyr Gin Arg Glu Asp Gly Ser Phe Ser Ala 
965 970 975 



Phe Gly Asn Tyr Asp Pro Ser Gly Ser Thr Trp Leu Ser Ala Phe Val 
980 985 990 



Leu Arg Cys Phe Leu Glu Ala Asp Pro Tyr He Asp He Asp Gin Asn 
995 1000 1005 



Val Leu His Arg Thr Tyr Thr Trp Leu Lys Gly His Gin Lys Ser 
1010 1015 1020 



Asn Gly Glu Phe Trp Asp Pro Gly Arg Val He His Ser Glu Leu 
1025 1030 1035 
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Gin Gly Gly Asn Lys Ser Pro Val Thr Leu Thr Ala Tyr He Val 
1040 " 1045 1050 



Thr Ser Leu Leu Gly Tyr Arg Lys Tyr Gin Pro Asn He Asp Val 
1055 1060 1065 



Gin Glu Ser He His Phe Leu Glu Ser Glu Phe Ser Arg Gly He 
1070 1075 1080 



Ser Asp Asn Tyr Thr Leu Ala Leu He Thr Tyr Ala Leu Ser Ser 
1085 1090 1095 



Val Gly Ser Pro Lys Ala Lys Glu Ala Leu Asn Met . Leu Thr Trp 
1100 1105 1110 



Arg Ala Glu Gin Glu Gly Gly Met Gin Phe Trp Val Ser Ser Glu 
1115 1120 1125 



Ser Lys Leu Ser Asp Ser Trp Gin Pro Arg Ser Leu Asp He Glu 
1130 1135 1140 



Val Ala Ala Tyr Ala Leu Leu Ser His Phe Leu Gin Phe Gin Thr 
1145 1150 1155 



Ser Glu Gly He Pro He Met Arg Trp Leu Ser Arg Gin Arg Asn 
1160 " 1165 1170 



Ser Leu Gly Gly Phe Ala Ser Thr Gin Asp Thr Thr Val Ala Leu 
1175 1180 1185 



Lys Ala Leu Ser Glu Phe Ala Ala Leu Met Asn Thr Glu Arg Thr 
1190 1195 1200 



Asn He Gin Val Thr Val Thr Gly Pro Ser Ser Pro Ser Pro Val 
1205 1210 1215 



Lys Phe Leu He Asp Thr His Asn Arg Leu Leu Leu Gin Thr Ala 
1220 1225 1230 



Glu Leu Ala Val Val Gin Pro Met Ala Val Asn He Ser Ala Asn 
1235 1240 1245 



Gly Phe Gly Phe Ala He Cys Gin Leu Asn Val Val Tyr Asn Val 
1250 1255 1260 



Lys Ala Ser Gly Ser Ser Arg Arg Arg Arg Ser He Gin Asn Gin 
1265 1270 1275 
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Glu Ala Phe ' Asp Leu Asp Val Ala Val Lys Glu Asn Lys Asp Asp 
1280 1285 1290 



Leu Asn His Val Asp Leu Asn Val Cys Thr Ser Phe Ser Gly Pro 
1295 1300 1305 



Gly Arg Ser Gly Met Ala Leu Met Glu Val Asn Leu Leu Ser Gly 
1310 1315 1320 



Phe Met Val Pro Ser Glu Ala lie Ser Leu Ser Glu Thr Val Lys 
1325 1330 1335 



Lys Val Glu Tyr Asp His Gly Lys Leu Asn Leu Tyr Leu Asp Ser 
1340 1345 1350 



Val Asn Glu Thr Gin Phe Cys Val Asn lie Pro Ala Val Arg Asn 
1355 1360 1365 



Phe Lys Val Ser Asn Thr Gin Asp Ala Ser Val Ser lie Val Asp 
1370 1375 1380 



Tyr Tyr Glu Pro Arg Arg Gin Ala Val Arg Ser Tyr Asn Ser Glu 
1385 1390 1395 



Val Lys Leu Ser Ser Cys Asp Leu Cys Ser Asp Val Gin Gly Cys 
1400 1405 1410 



Arg Pro Cys Glu Asp Gly Ala Ser Gly Ser His His His Ser Ser 
1415 1420 1425 



Val lie Phe lie Phe Cys Phe Lys Leu Leu Tyr Phe Met Glu Leu 
1430 1435 1440 



Trp Leu 
1445 



<210> 5 

<211> 2608 

<212> DNA 

<213> HUMAN 



<220> 

<221> Intron 
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<220> 

<221> exon 

<222> (952) . . (1069) 

<220> 

<221> Intron 

<222> (1070) . . (2608) 



<400> 5 

gtaaaaattt ataaagttct ttgcccatac atattttgtt tagtgtttgt tttaaataag 60 

ctttgcccgc tttctaatgt ttaagtacaa acatagtgta actaagaact aagtagacca 120 

aaaggatttt ttaggaaatg atatttattg aatctaaata cagtttttga taaagccaca 180 

cataaattat ggcaggaagg tctcatcaat gagaagatag gccttttttt ttttttttaa 240 

ctgaagggtg attttgactt ccttgaagtc tcatgattct tgttgaagaa aaattgctgg 300 

gagtacattt gttgtcacag gatgggaagc actcatgatt acctcctgtg acccctggca 360 

gtgctgctaa ctgaaccctg ctcctcacaa agcattccca ggagtcacag ggagaagggg 420 

catgggtggt ggaaagaatt cagcttggct gataaacccc gtaccacctg gcctgataat 480 

tgagcaggta aatcatgaaa tccacatagt attttatagt cagctgttta aagatacttg 540 

agttaacaca tgagtgaaat ctcaaggaaa caaataacag cattgacagg gatacagaga 600 

aaaacttctg caaatttaga gaaaaaattg gagttaagtt tgaaaatgtg tatttattat 660 

ctataaaaaa tttgtgaaaa aataatgttt attctgaaga tgtaaatttt gcaggaagat 720 

tttattagaa tatggatcaa tatgcagtat tatgacctta tgatgaccta ttctttgaaa 780 

agttgggatt tactgtttta tacttaaacc ttttaaatgg ttttaaattc agatatgtaa 840 

acaataggaa aaattgaaat tcttccaaaa atagtttaga ttattttggc ttatttcaaa 900 

atgtatcagt tcttggtttt gtgatgttta tatttattat ctt.gacttca gt tac agg 958 % 

Tyr Arg 
1 

att tac caa gaa ttt gaa gta act gta cct gat tct ate act tct tgg 1006 
lie Tyr Gin Glu Phe Glu Val Thr Val Pro Asp Ser lie Thr Ser Trp 
5 10 15 

gtg get act ggt ttt gtg ate tct gag gac ctg ggt ctt gga eta aca 1054 
Val Ala Thr Gly Phe Val lie Ser Glu Asp Leu Gly Leu Gly Leu Thr 
20 25 30 

act act cca gtg gag gtattgtatt aaagagctgc ttatcagtat tacggtgaca 1109 
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Thr Thr Pro Val Glu 
35 

ttaagctaat acagcgtcag 
tcacagttta gagattcctt 
tgtgagctat atatagctat 
tagaaaagtt tattattgac 
aaaagagaga gagagaaaga 
caagaatgaa cagaacttct 
caaattgcat catatattgt 
gagtggcctg aattttctca 
attcatgaaa aatacagtgt 
agattgtaag ctcaggccac 
attcaagaat cattaggaac 
gcagactgtc acccactaac 
ccttcacaaa tgtgcagata 
ctaaactaac ccaggtttag 
ttggccaaca atatcagaat 
cacattttct catctgtatg 
tttcctcttt aaatttagtc 
gctgttggat ttggcagggc 
gagcaaattg cttcgggatt 
caatatgaat ttgtactagt 
aggagactaa cagtaacctt 
gtttggttgc tataaggcac 
caagtcaatt tgccgtcatt 
gttgacattt ctagctgaag 
cataaaatgt gaagaatttt 

<210> 6 

<211> 23 

<212> DNA 

<213> HUMAN 
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ctcctcaatt 


ttttttttaa 


atgactgctt 


ataatgttta 


1169 


ggctttgtct 


ttaggttttt 


atctgtttta 


tatttaagaa 


1229 


ataaactgct 


aaatgtgcaa 


agtccgtatt 


aagatttggg 


1289 


ctgaactaac 


catctccaaa 


ggccagaaga 


gagagaaaga 


1349 


ggagaggaga 


gagagagtga 


gtctttctat 


ttgtcctctt 


1409 


caagatgttc 


cctagccaat 


attccatcat 


gtcttttggt 


1469 


ttcctaagcc 


agtcactggc 


aggaggaata 


taatgaccat 


1529 


tttgaaattg 


aaatgtaatt 


ttgatttaca 


aaataatcgt 


1589 


agattgaaaa 


atgctttggg 


tttatataga 


aattggaatt 


1649 


tataaacaga 


caattcagca 


acatgaatgt 


ctgaagggac 


1709 


atggggcaat 


ttttcattgt 


ctggggctgt 


cctgagtatt 


1769 


tacctatagc 


accttcgagt 


catggtgaca 


atctaagaca 


1829 


aactctagag 


ggagttactg 


ctgccagcaa 


aaccactggc 


1889 


ctttagatgc 


aggtgtgggg 


cttggccttt 


tctgtaggac 


1949 


tgggtcactg 


aggaggaagc 


acatgtattc 


agatgtccca 


2009 


taaaaataaa 


tcatatatat 


gttttagaaa 


taatttccaa 


2069 


aggaagcaca 


tgtattcaga 


tgtcccacac 


tagaacaggg 


2129 


ttttaaagca 


gattggtgga 


gtcaatacag 


catgaaagaa 


2189 


agacaggctg 


ggttctagtt 


ctggctcctc 


tacttgccag 


2249 


tacaaaaatc 


tcaaaaattt 


aattttcttt 


tctataaact 


2309 


atggggttgt 


aataaccaaa 


caaaataatt 


tatgtgaagt 


2369 


ttaataaagt 


atagcaatta 


ttatgttaag 


taacataaat 


2429 


catttgtgat 


aagttgctgt 


ttgctttctg 


ttgatagcaa 


2489 


ttaaaagctt 


cacaggtttt 


ataaagattg 


catttaattg 


2549 


gacctgaata 


aaaatatgta 


ctcgttgtgt 


tctttccag 


2608 
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<400> 6 

ttcaaattct tggtaaatcc tgt 



23 



<210> 7 

<211> 23 

<212> DNA 

<213> HUMAN 

<400> 7 

ttcaaattct tggtaaatcc tgg 23 

<210> 8 

<211> 22 

<212> DNA 

<213> HUMAN 



<210> 9 

<211> 18 

<212> DNA 

<213> HUMAN 

<400> 9 

tttagattat tttggctt 18 

<210> 10 

<211> 18 

<212> DNA 

<213> HUMAN 



<400> 8 

atgaccttat gatgacctat tc 



22 



<400> 10 

atggttagtt caggtcaa 



18 



<210> 11 



<211> 27 
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<212> DNA 
<213> HUMAN 

<400> 11 

tgtatcagtt cttggttttg tgatgtt 

<210> 12 

<211> 30 

<212> DNA 

<213> HUMAN 

<400> 12 

ccaagaagtg atagaatcag gtacagttac 

<210> 13 

<211> 40 

<212> DNA * 

<213> HUMAN . 

<400> 13 . 

tattatcttg acttcagtta caggatttac caagaatttg 

<210> 14 

<211> 37' 

<212> DNA 

<213> HUMAN 

<400> 14 

tattatcttg acttcagttc caggatttac caagaat 

<210> 15 

<211> 18 

<212> DNA 

<213> synthetic construct 
<220> 
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<221> primer 

<222> (1)..(18) 

<400> 15 

gtagcccagg cagacgcc 

<210> 16 

<211> 20 

<212> DNA 

<213> synthetic construct 
<220> 

<221> primer 

<222> (1) . . (20) 

<400> 16 

cgcattgtta cactcttctc 

<210> 17 

<211> 19 

<212> DNA 

<213> synthetic construct 
<220> 

<221> primer 

<222> (1) . . (19) 

<400> 17 

gattcttcaa atggacttt 

<210> 18 

<211> 21 

<212> DNA 

<213> synthetic construct 
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18 



20 
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<220> 



<221> 



primer 



<222> 



(1)..(21) 



<400> 18 

tgaattccca atcctggagg a 



21 



<210> 19 

<211> 20 

<212> DNA 

<213> synthetic construct 
<220> 

<221> primer 

<222> (1) . . (20) 

<400> 19 

ttcaggaatg tggactctgg 20 

<210> 20 

<211> 20 

<212>. DNA 

<213> synthetic construct 
<220> 

<221> primer 

<222> (1)..(20) 



<400> 20 

ctgggagcac ttggttgtca 



20 



<210> 21 



<211> 20 



<212> DNA 



<213> synthetic construct 
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<220> 

<221> primer 

<222> (1)..(20) 

<400> 21 

acaatttcag acttctgagg 

<210> 22 

<211> 21 

<212> DNA 

<213> synthetic construct 
<220> 

<221> primer 

<222> (1)..(21) 

<400> 22 

gacgaagatc tatccaaaat c 

<210> 23 

<211> 22 

<212> DNA 

<213> synthetic construct 
<220> 

<221> primer 

<222> (1)..(22) 

<400> 23 

gtgacaacca ctgttggatc aa 

<210> 24 

<211> 20 

<212> DNA' 
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20 



21 
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<213> synthetic construct 

<220> 

<221> primer 

<222> (1)..{20) 

<400> 24 

tacatttctt gaaatacctg 

<210> 25 

<211> 18 

<212> DNA 

<213> synthetic construct 
<220> 

<221> primer 

<222> (1)..(18) 

<400> 25 

ggctgtgtca cagagatc 

<210> 26 

<211> 20 

<212> DNA 

<213> synthetic construct 
<220> 

<221> primer 

<222> (1)..(20) 

<400> 26 

gccacccaag aagtgataga 

<210> 27 

<211> 19 
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20 



18 



55 



WO 02/070738 



PCT/CA02/00291 



<212> 



DNA 



<213> 



synthetic construct 



<220> 



<221> 



primer 



<222> 



(1)..(19) 



<400> 27 

cggcttcaag gaaacatct 



19 . 



<210> 28 

<211> 22 

<212> DNA 

<213> synthetic construct 
<220> 

<221> primer 

<222> (1) . . (22) 

<400> 28 

cagcaacatc taaatcaaag gc 22 

<210> 29 

<211> 18 

<212> DNA 

<213> synthetic construct 
<220> 

<221> primer 

<222> (1)..(18) 



<400> 29 

cacagccaaa gttccata 



18 



<210> 30 
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<211> 20 

<212> DNA 

<213> synthetic construct 
<220> 

<221> primer 

<222> (1)..(20) 

<400> 30 

gctaggacct gttgtacacc 
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20 
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We claim: 

1. An oligonucleotide comprising a sequence which binds specifically to (i) a region of CD109 
nucleic acid that includes a single nucleotide polymorphism that is distinctive of a Gov 3 allele 
and/or (ii) a region of CD109 nucleic acid that includes a single nucleotide polymprphism that is 
distinctive of a Gov 5 allele. 

2. The oligonucleotide of claim 1, comprising 8 to 50 nucleotides. 

3. The oligonucleotide of claim 1 , wherein the nucleic acid specifically binds to one of (i) or (ii) 
under high stringency hybridization conditions. 

4. The oligonucleotide of claim 2, wherein the stringent hybridization conditions comprise 
0.1XSSC, 0.1 % SDS at 65°C. 

5. The oligonucleotide of claim. 1, wherein the CD109 nucleic acid comprises genomic DNA, 
cDNA, or RNA corresponding to the Gov 8 allele of the CD109 gene or locus, or comprises 
genomic DNA, cDNA, or RNA corresponding to the Gov 5 allele of the CD109 gene or locus. 

6. The oligonucleotide of claim 5, wherein the Gov 3 allele comprises an A at a position 
corresponding to position 2108 of SEQ ID NO:1 and corresponding to position 954 of SEQ ID 
NO:5. 

7. The oligonucleotide of claim 5, wherein the Gov 5 allele comprises a C at a position 
corresponding to position 2108 of SEQ ID NO:3 and corresponding to position 954 of SEQ ID 
NO:5. 

8. The oligonucleotide of claim 1, comprising a sequence complementary to the Gov 3 allele or 
to the Gov 5 allele. 

9. The oligonucleotide of claim 6, comprising a sequence selected from the group consisting of: 

(a) 8-50 nucleotides of SEQ ID NO:1 ; 

(b) a sequence that is complementary to a sequence specified in (a); and 

(c) a sequence having at least 70% sequence identity to a sequence in (a) or (b), wherein the 
sequence having identity is capable of hybridization to CD109 under high stringency hybridization 
conditions. 
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10. The oligonucleotide of claim 7, comprising a sequence selected from the group consisting 
of: 

(a) 8-50 nucleotides of SEQ ID NO:3; 

(b) a sequence that is complementary to a sequence specified in (a); and 

(c) a sequence having at least 70% sequence identity to a sequence in (a) or (b), wherein the 
sequence having identity is capable of hybridization to CD1 09 under high stringency hybridization 
conditions. 

11. A oligonucleotide comprising all or part of any one of SEQ ID NO:6-SEQ ID NO:14 or a 
complement thereof. 

12. The oligonucleotide of claim 11, comprising 8 to 50 nucleic acids. 

13. The oligonucleotide of claim 1, wherein the nucleic acid is capable of use as a probe in a 
hybridization assay. 

14. The oligonucleotide of claim 1, wherein the nucleic acid sequence is detectably labelled. 

15. The oligonucleotide of claim 14, wherein the detectable label comprises: 

(a) a fluorogenic dye; and/or 

(b) a biotinylation modification; and/or 

(c) a radiolabel. 

1 6. The oligonucleotide of claim 1 , wherein the sequence comprises DNA, a DNA analog, RNA 
or an RNA analog. 

17. The oligonucleotide of claim 1 , wherein the oligonucleotide is attached to a substrate. 

18. The oligonucleotide of claim 1, wherein the oligonucleotide is capable of use as a primer that 
will specifically bind proximate to, and/or cause elongation through, a CD 109 sequence, including 
the single nucleotide polymorphism distinctive of the Gov 3 or Gov 1 ' alleles. 

19. A Gov genotyping kit comprising a detection agent for detecting the presence of a Gov allele- 
specific target sequence in a CD109 nucleic acid derived from a subject 



59 



WO 02/070738 



PCT/CA02/00291 



20. The kit of claim 1 9, wherein the detection agent comprises a nucleic acid and/or a restriction 
enzyme. 

21 . The kit of claim 1 9, further comprising a container. 

22. The kit of claim 21, wherein the container comprises a biological sample container for 
housing the detection agent. 

23. The kit of claim 19, further comprising a plate having a plurality of wells and' having bound 
thereto probes having a nucleic acid sequence which specifically binds to a CD109 sequence 
including a Gov 8 or a Gov 6 allele target sequence. 

24. The kit of claim 20, wherein the restriction enzyme is selected from the group consisting of 
Bst2UI, BstNl, BstOI, EcoRII, Maelll, MspR91, Mval, ScrFI or an isoschizomer thereof. 

25. The kit of claim 1 9, further comprising an amplification agent for amplifying the nucleic acid. 

26. The kit of claim 25, wherein the amplification agent amplifies a region of CD109 platelet, T 
cell, or endothelial cell mRNA including the single nucleotide polymorphism distinctive of a Gov 3 
or Gov 5 allele. 

27. The kit of claim 25, wherein the amplification agent comprises a primer set including first and 
second primers, wherein the first primer is a nucleic acid that will specifically bind proximate to, 
and/or cause elongation through, CD109 sequence that includes the single nucleotide 
polymorphism distinctive of a Gov 8 allele and the second primer is a nucleic acid that will 
specifically bind proximate to, and/or cause elongation through, CD109 sequence that includes 
the single nucleotide polymorphism distinctive of a Gov 6 allele. 

28. The kit of claim 19, wherein the nucleic acid is obtained by amplification with all or part of the 
nucleic acid of any one of SEQ ID NO:6 - SEQ ID NO:14 or the complement thereof. 

29. The kit of claim 1 9, further comprising all or part of a CD1 09 gene, a CD1 09-encoding mRNA, 
or a CD109 cDNA made from a CD1 09-encoding mRNA. 

30. The kit of claim 19, comprising the oligonucleotide of any of claims 1 to 18. 
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31 . The kit of claim 19, for detecting that the subject has or is at risk of a disease, disorder or 
abnormal physical state. 

32. The kit of claim 31 , wherein the disease, disorder or abnormal physical state comprises a 
blood disease, disorder or abnormal physical state. 

33. The kit of claim 32, wherein the blood disease, disorder or abnormal physical state comprises 
bleeding of the subject, or increased risk of bleeding, due to destruction of blood platelets. 

34. The kit of claim 33, wherein the blood disease, disorder or abnormal physical state comprises 
post-transfusion purpura ("FTP"), post-transfusion platelet refractoriness ("PR") or neonatal 
alloimmune thrombocytopenia ("NAIF). 

35. The kit of claim 33, wherein the nucleic acid is obtained from mRNA from human platelets, 
T cells, endothelial cells, or human genomic DNA. 

36. A method of Gov alloantigen genotyping a subject comprising: 

(a) providing a CD109 nucleic acid sample derived from the subject and 

(b) detecting a region of CD109 nucleic acid that includes a single nucleotide polymorphism 
distinctive of a Gov 8 or a Gov 5 allele. 

37. The method of claim 36 comprising determining whether the subject is homozygous or 
heterozygous for the Gov alleles. 

38. The method of claim 37, wherein the subject is a human and the Gov genotype is used to 
determine that the subject has, or is at risk of a disease, disorder or abnormal physical state. 

39. The method of claim 38, wherein the disease, disorder or abnormal physical state comprises 
a blood disease, disorder or abnormal physical state. 

40. The method of claim 39, wherein the blood disease, disorder or abnormal physical state 
comprises bleeding of the subject, or increased risk of bleeding, due to destruction of blood 
platelets. 
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41. The method of claim 40, wherein the blood disease, disorder or abnormal physical state 
comprises post-transfusion purpura ("PTP"), post-transfusion platelet refractoriness ("PR") or 
neonatal alloimmune thrombocytopenia ("NAIT"). 

42. The method of claim 41 , wherein the nucleic acid is obtained by amplifying the nucleic acid 
from the subject 

43. The method of claim 42, wherein the nucleic acid is obtained by amplification with all or part 
of the oligonucleotide of any of claims 1 to 1 8. 

44. The method of claim 41, wherein the nucleic acid is obtained from mRNA from human 
platelets, T cells, endothelial cells, or human genomic DNA. 

45. The method of claim 36, wherein the detection step comprises determining the nucleotide 
sequence of the CD109 nucleic acid. 

46. The method of clam 36, wherein the detection step comprises contacting the nucleic acid with 
the oligonucleotide of any of claims 1 to 18 under high stringency conditions. 

47. The method of claim 46, wherein the oligonucleotide will selectively hybridize to (i) a region 
of CD109 nucleic acid that includes a single polymorphism distinctive of a Gov 2 allele or (ii) a 
region of CD 109 nucleic acid that includes a single polymorphism distinctive of a Gov 6 allele. 

48. The method of claim 36, wherein the detecting step comprises: 

(a) performing a restriction endonuclease digestion of the nucleic acid, thereby providing a 
nucleic acid digest; and 

(b) contacting the digest with the oligonucleotide. 

49. The method of claim 47, wherein the hybridization occurs either during or subsequent to PCR 
amplification and the analysis is by "Real-Time" PCR analysis, or fluorimetric analysis. 

50. The method of claim 36, wherein the detection step comprises: 

(a) incubation of the amplified nucleic acid with a restriction endonuclease under conditions 
whereby the DNA will be cleaved if the nucleic acid comprises a recognition site for the enzyme; 
and 
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(b) determining whether the nucleic acid contains a recognition site for th restriction enzyme 
characteristic of cDNA made from mRNA encoding a Gov 3 or Gov 5 allele of CD109. 

51. The method of claim 50, wherein the restriction enzyme is selected from the group consisting 
of Bst2UI, BstNl, BstOI, EcoRII, Maelll, MspR91, Mval, ScrFI or an isoschizomer thereof. 

52. The method of claim 50, wherein the determination step includes size analysis of the nucleic 
acid. 

53. The method of claim 50, wherein the amplified nucleic acid is analyzed by electrophoretic 
mobility and the mobility of the amplified nucleic acid is compared to the characteristic mobility 
of amplified nucleic acid fragments corresponding to the Gov 3 or Gov 5 alleles of CD109. 

54. A method of amplifying CD109 mRNA comprising amplifying the mRNA by PGR using an 
oligonucleotide of any of claims 1 to 1 8. 

55. A Gov 3 specific antibody. 

56. The antibody of claim 55, that recognizes specifically a Gov 3 allele-specific CD109 epitope 
corresponding to the polypeptide encoded by a CD109 nucleic acid containing an A at the 
position corresponding to position 2108 of SEQ ID NO:1 and position 954 of SEQ ID NO:5., and 
containing the amino acid Tyrosine at the position corresponding to position 703 of the CD109 
protein encoded by SEQ ID NO:1 . 

57. A Gov 5 specific antibody. 

58. The antibody of claim 57, that recognizes specifically a Gov 5 allele-specific CD109 epitope 
corresponding to the polypeptide encoded by a CD109 nucleic acid containing a C at the position 
corresponding to position 2108 of SEQ ID NO:3 and position 954 of SEQ ID NO:5., and 
containing the amino acid Serine at the position corresponding to position 703 of the CD109 
protein encoded by SEQ ID NO:3. 

59. The antibody of any of claims 55 to 58, further comprising a monoclonal antibody or a 
polyclonal antibody. 

60. The antibody of any of claims 55 to 59, further comprising a detectable label. 
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61. An immunogenic composition comprising a Gov specific antibody of any of claims 55 to 60. 

62. A method of Gov alloantigen phenotyping a subject, comprising: 

(a) providing a CD109 polypeptide sample derived from the subject; and 

(b) detecting the presence of a Gov 3 or a Gov 6 antigen in the CD109 polypeptide. 

63. The method of claim 62, wherein the CD109 is membrane bound CD109 or isolated CD109. 

64. The method of claim 62, wherein the detection step comprises contacting the polypeptide 
sample with the antibody of any of claims 55 to 60. 

65. A diagnostic kit for Gov alloantigen phenotyping a subject, comprising a Gov 3 antibody and/or 
a Gov 6 antibody of any of claims 55 to 60. 

66. The kit of claim 65, further comprising a container. 

67. An isolated polypeptide containing Gov 3 allele-specific amino acid sequence and which is 
specifically reactive with a Gov 3 antibody. 

68. An isolated polypeptide containing Gov 5 allele-specific amino acid sequence and which is 
specifically reactive with a Gov 1 ' antibody. 

69. The isolated polypeptide of claim 67 or 68, comprising between 4 and 100 amino acids. 

70. The isolated polypeptide of claim 67, 68, or 69, comprising a full-length CD109 polypeptide, 
or a fragment of a CD1 09 polypeptide. 

71. An isolated CD109 polypeptide fragment, comprising a Gov 3 or a Gov 5 antigen. 

72. The polypeptide fragment of claim 71 , comprising all of, or a fragment of, the protein encoded 
by SEQ ID NO:1., and in which the amino acid corresponding to position 703 of the protein 
encoded by SEQ ID NO:1 is a Tyrosine 
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73. The polypeptide fragment of claim 72, comprising all of, or a fragment of, the protein encoded 
by SEQ ID NO:3 M and in which the amino acid corresponding to position 703 of the protein 
encoded by SEQ ID NO:3 is a Serine. 

74. The polypeptide fragment of claim 72 or 73, comprising between 4 and 100 amino acids. 

75. The polypeptide fragment of claim 74, comprising between 7 and 50 amino acids. 

76. The polypeptide fragment of claims 67 to 75, in which the polypeptide is purified from native 
CD109, is synthetic, or is prepared by recombinant means. 

77. The polypeptide fragment of claims 67 to 76, in which the polypeptide is bound to a substrate. 

78. A fusion compound comprising the polypeptide of any of claims 67 to 77 connected to an 
immunogenic carrier. 

79. The fusion compound of claim 78, wherein the immunogenic earner comprises a 
proteinaceous carrier. 

80. The fusion compound of claim 79, wherein the immunogenic carrier comprises a detectable 
label. 

81. A Gov 8 or Gov 5 specific antibody recognizing the fusion compound of claim 80. 

82. An immunogenic composition comprising the polypeptide, polypeptide fragment or fusion 
compound of any of claims 67 to 80. 

83. A method of producing a Gov 3 or Gov* specific antibody, comprising contacting an animal 
with the immunogenic composition of claim 82, so that the animal produces antibodies against 
the immunogenic composition. 

84. The method of claim 83, wherein the animal is a bird or a mammal. 

85. A method of screening an antibody producing culture to determine whether the culture 
produces Gov 3 or Gov 5 specific antibody, comprising: 

(a) contacting a polypeptide of the invention with the culture; and 
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86. The method of claim 85, wherein the polypeptide comprises a detectable label. 

87. The method of claim 86, wherein the polypeptide is attached to a substrate. 

88. A method of purifying a Gov allele-specific antibody from a sample, comprising: 

(a) contacting a Gov allele-specific antibody with a polypeptide of the invention comprising a Gov 8 
or Gov 6 antigen, so that an antibody.polypeptide complex is formed; 

(b) separating the complex from the sample; and 

(c) next separating the antibody from the polypeptide. 

89. The method of claim 88, wherein the polypeptide is bound to a substrate. 

90. The method of claim 89, wherein the polypeptide comprises a detectable label. 

91. A method of purifying a Gov polypeptide from a sample, comprising: 

(a) contacting a Gov allele-specific antibody with a polypeptide of the invention containing a Gov 8 
or Gov 5 -specific epitope, so that an antibodyipolypeptide complex is formed; 

(b) separating the complex from the sample; and 

(c) next separating the antibody from the polypeptide. 

92. The method of claim 91 , wherein the antibody is bound to a substrate. 

93. The method of claim 92, wherein the antibody comprises a detectable label. 

94. A method of screening a subject sample to determine whether the sample contains Gov 8 or 
Gov 5 -specific antibodies, comprising: 

(a) contacting a polypeptide of the invention with the sample; and 

(b) detecting the presence or absence of Gov 8 or Gov 5 specific antibody. 

95. The method of claim 94, wherein the polypeptide comprises a detectable label. 

96. The method of claim 95, wherein the polypeptide is attached to a substrate 
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97. The method of claim 96, wherein the subject comprises a mother of a fetus or a newborn 
infant, or the fetus or newborn infant itself, and the presence of Gov 3 or Gov 6 -specific antibody 
indicates that the fetus or infant has, or is at risk of NAIT. 

98. The method of claim 97, wherein the presence of Gov 3 or Gov 5 specific antibody indicates 
that the subject has, or is at risk of a blood disease, disorder or abnormal physical state. 

99. The method of claim 98, wherein the blood disease, disorder or abnormal physical state 
comprises bleeding of the subject, or increased risk of bleeding, due to destruction of blood 
platelets. 

100. The method of claim 99, wherein the blood disease, disorder or abnormal physical state 
comprises post-transfusion purpura fPTP"), post-transfusion platelet refractoriness ("PR") or 
neonatal alloimmune thrombocytopenia (NAIT). 

101. The method of claim 100, wherein the sample comprises human serum or plasma. 

1 02. A diagnostic kit for detection of Gov 8 or Gov 5 specific antibody, comprising a polypeptide of 
any of claims 67 to 80. 

103. The kit of claim 102, further comprising a container. 

104. A method of determining Gov antibody specificity, comprising: 

(a) contacting an antibody with a first polypeptide comprising a Gov 3 antigen and a second 
polypeptide comprising a Gov 6 antigen; and 

(b) determining whether the antibody binds to either or both of the first and second polypeptide. 

105. A method of blocking Gov 3 antibody binding to an antigen, comprising: contacting the 
antibody with a polypeptide of the invention comprising a Gov 3 antigen so that an 
antibodypolypeptide complex is formed. 

106. The method of claim 105, wherein the polypeptide comprises a detectable label. 

107. The method of claim 106, wherein the polypeptide is bound to a substrate. 
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108. A method of blocking Gov 6 antibody binding to an antigen, comprising: contacting the 
antibody with a polypeptide of the invention comprising a Gov b antigen so that an 
antibodyipolypeptide complex is formed. 

109. The method of claim 108, wherein the polypeptide comprises a detectable label. 

110. The method of claim 109, wherein the polypeptide is bound to a substrate. 

1 1 1 . A pharmaceutical composition comprising the polypeptide of any of claims 67 to 80. 

1 12. A method of immunizing a subject so that the subject will produce anti-idiotypic 
antibodies, comprising administering to the subject the immunogenic composition of any of 
claims 82 to 84. 

1 13. A method of blocking Gov 8 or Gov 5 specific antibodies from binding to CD109 in a 
subject, comprising: administering to the subject polypeptides of the invention capable of 
binding to Gov 3 and/or Gov 6 specific antibodies. 

1 14. The method of claim 113, wherein the polypeptide comprises a detectable label. 

115. The method of claim 1 14, wherein the binding of the polypeptide to the Gov 3 or Gov 5 
specific antibody prevents alloimmune cell destruction by the antibody. 

116. The method of claim 115, wherein the binding of the polypeptide to the Gov 3 or Gov 5 - 
specific antibody depletes the antibody. 

1 17. The method of claim 1 1 6, wherein the subject has or is at risk of a blood disease, disorder 
or abnormal physical state. 

118. The method of claim 117, wherein the blood disease, disorder or abnormal physical state 
comprises bleeding of the subject, or increased risk of bleeding, due to alloimmune destruction 
of blood platelets. 

119. The method of claim 118, wherein the blood disease, disorder or abnormal physical 
state comprises post-transfusion purpura ("PTP"), post-transfusion platelet refractoriness 
fPR") or neonatal alloimmune thrombocytopenia ("NAIT"). 
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